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METHOD OF MODULATING METABOLITE BIOSYNTHESIS IN RECOMBINANT CELLS 



FIELD OF THE INVENTION 

5 

The present invention relates to the field of modulating the biosynthesis of metabolites in 
cells. More particularly, the invention relates to the cloning and expression of transcription 
factor genes, such as genes coding for AP2 domain class transcription factors and the use 
of such transcription factor for regulating the expression of genes involved in the 
1 0 biosynthesis of metabolites or precursors therefor. Especially the invention relates to the 
cloning and expression of AP2 domain class plant transcription factors and use hereof for 
regulating the expression of biosynthetic genes involved in the secondary metabolite 
biosynthesis in plant cells. 



15 



-BACKGROUNDX>RTHE INVENTION^ND PRIOR ART- 



Generally, two basic types of metabolites are synthesised in cells, i.e. those referred to as 
primary metabolites and those referred to as secondary metabolites. A primary metabolite 
20 is any intermediate in, or product of the primary metabolism in cells. The primary 

metabolism in cells is the sum of metabolic activities that are common to most, if not all, 

li ving ce ll s a n d a r e necessa r y f oi basa l y i ow t h and m aintenance of the cells. Primary 

metabolism thus includes pathways for generally modifying and synthesising certain 
carbohydrates, proteins, fats and nucleic acids, with the compounds involved in the 
25 pathways being designated primary metabolites. 

In contrast hereto, secondary metabolites usually do not possess a basal function in cell 
growth and maintenance. They are a group of chemically very diverse products that often 
have a restricted taxonomic distribution. Secondary metabolites normally exist as members 
30 of closely related chemical families, usually of a molecular weight of less than 1,500, 
although some bacterial toxins are considerably larger. Two examples of fungal cell 
secondary metabolites are penicillin and ergotamine. 

Plant metabolites include a diverse array of chemically unrelated compounds such as 
35 carbohydrates and lipids (e.g. mono-, oligo- and polysaccharides, sugar alcohols, organic 
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acids, fatty acids and lipids, acetylenes and thiophenes), nitrogen-containing compounds 
(e.g. amino acids, amines, glycosides, glucosinolates, purines, pyrimidines and 
polypeptides) of which most, but not all, generally are referred to as primary metabolites. 
Accordingly, some compounds such as fatty acids, sugars and steroids may e.g. be 
5 categorised both as primary metabolites and secondary metabolites (see e.g. Dewick, 
P.M., 1997, Medicinal Natural Products A Biosynthetic Approach, John Wiley & Sons, 
Chichester). 

Secondary plant metabolites include e.g. alkaloid compounds (e.g. terpenoid indole 
10 alkaloids and indole alkaloids), phenolic compounds (e.g., quinones, lignans and 
fiavonoids), terpenoid compounds (e.g. monoterpenoids, iridoids, sesquiterpenoids, 
diterpenoids and triterpenoids). In addition, secondary metabolites include small molecules 
(i.e., having a molecular weight of less than 600), such as substituted heterocyclic 
compounds which may be monocyclic or polycyclic, fused or bridged. 

15 



Many plant secondary metabolites have value as pharmaceuti ca l s, food colours, flavours 
and fragrances. Plant pharmaceuticals include e.g.: taxol, digoxin, colchicine, codeine, 
morphine, quinine, shikonin, ajmalicine and vinblastine. Examples of secondary 
metabolites that are useful as food additives include anthocyanins, vanillin, and a wide 
20 variety of other fruit and vegetable flavours and texture modifying agents. 

Plant secondary metabolites such as terpenoid indole alkaloids (TIA) represent a class of 
pharmaceutical^ useful compounds which naturally occur in many plant species. 

25 Some plant secondary metabolites are linked to plant or plant cell defence mechanism and 
may e.g. confer to the plant antimicrobial activity, protection against UV light, herbivores, 
pathogens, insects and nematodes, and the ability to grow at low light intensity. 



There are numerous examples of the application of plant secondary metabolites such as 
30 TIA's in medicine. The monomeric alkaloids serpentine and ajmalicine found in 
Catharanthus roseus are e.g. used as tranquillisers and to reduce hypertension, 
respectively. The dimeric alkaloids vincristine and vinblastine, also found in C. roseus, are 
potent antitumor drugs. Camptothecin, a monomeric TIA found in Camptotheca acuminata, 
also possesses anti-tumor activity. Quinine from Cinchona officinalis is used in malaria 
35 treatment. 
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However, a major problem associated with the industrial use of the above metabolites is 
that only very small or variable amounts of these compounds are present in plants. The 
recovery of useful metabolites from their natural sources is thus in many instances difficult 
5 due to the enormous amounts of source material which may be required for the isolation of 
utilisable quantities of the desired products. As an example, over 500 kg of Catharanthus 
roseus is needed to obtain 1g of vincristine. Extraction is both costly and tedious, requiring 
large quantities of raw material and extensive use of chromatographic fractionation 
procedures. The low levels of these compounds in plants may also imply that many of the 
10 compounds are not detected when performing normal screening procedures, hence many 
unknown compounds may exist 

Biosynthesis of TIA compounds proceeds in most plants via many enzymatic steps. TIA 
compounds consist of an indole moiety provided by tryptamine and a terpenoid portion 

15 provided by the iridoid glucoside compound secologanin. Tryptamine is derived from 
primary metabolism by a single enzymatic conversion of the amino acid tryptophan, a 
reaction catalysed by the enzyme tryptophan decarboxylase (TDC) (EC 4.1.1.28). The 
biosynthesis of secologanin requires a number of enzymatic conversions of which the first 
step is the hydroxylation of geraniol by the enzyme geraniol 10-hydroxylase (G10H). 

20 Tryptamine and secologanin are condensed by the enzyme strictosidine synthase (STR) 
(EC 4.3.3.2) to form strictosidine, which is the general precursor of all terpenoid indole 
alkaloids found in plants. The first enzymatic conversion of strictosidine into strictosidine 
aglucon is catalysed by strictosidine-&-D-glucosidase (SG) (EC 3.2.1.105). Many different 
enzyme activities convert strictosidine aglucon into the large variety of terpenoid indole 

25 alkaloid end products. 

The best progress on molecular characterisation of the terpenoid indole alkaloid pathway 
has been made with C. roseus or Madagascar periwinkle, a member of the Apocynaceae 
family. C. roseus cells have the genetic potential to synthesise over a hundred terpenoid 

30 indole alkaloids. The biosynthesis of terpenoid indole alkaloids is strongly regulated, and 
depends on plant cell type and environmental conditions. Their biosynthesis is e.g. induced 
by fungal elicitors (Moreno et al., 1995), jasmonates (Gantet et al., 1998) and auxin 
starvation (Gantet et al., 1998). Many monomelic TIA compounds are found in all plant 
organs, but vindoline and vindoline-derived dimeric alkaloids are only found in chloroplast- 

35 containing plant tissues (Meijer et al. , 1 993). 
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Until now, only a limited number of genes coding for enzymes involved in terpenoid indole 
alkaloid biosynthesis have been cloned, such as cDNA clones encoding STR and TDC 
isolated from C. roseus. In addition, cDNA clones for NADPH: cytochrome P450 reductase 
5 (CPR) (Meijer et al., 1993), which is essential for the G10H-catalysed reaction, and SG 
have been isolated (Geerlings, Memelink, van der Heijden, and Verpoorte, unpublished 
results). 

Gene expression studies by the present inventors have shown that the regulation of 
10 terpenoid indole alkaloid biosynthesis is controlled largely, if not uniquely, at the level of the 
expression of biosynthetic genes. Accordingly, analysis of the expression of the terpenoid 
indole alkaloid biosynthetic genes Tdc and Strt showed that their expression is low, 
especially in cell cultures. It was found, that the level of gene expression is likely to be 
limiting for alkaloid production. Overexpression of a single biosynthetic gene {Tdc or Sf/1) 
15 in transgenic C. roseus cells resulted in significantly elevated levels of the corresponding 
enzyme activity. However, this did not result in elevated alkaloid levels, presumably 
because many other enzymes are limiting and would need to be overexpressed. 

These studies have further demonstrated what is already known, that the genes are 
20 coordinately regulated depending on cell type or environmental conditions. Strt and Tdc 

mRNA accumulate in suspension-cultured cells after auxin starvation (Pasquali et al., 

1992) or phosphate starvation, exposure to fungal elicitors (Pasquali et al., 1992) or 
(methyl) jasmonate (Menke et al., 1999) and their distribution in the plant is 
developmentally regulated with the highest levels in the roots (Pasquali et al., 1992). In 
25 leaves Tdc and Sf/1 are induced by a UV-B light pulse (Ouwerkerk and Memelink, 

unpublished results). Cpr mRNA accumulation is rapidly induced by fungal elicitor (Meijer 
etal., 1993). 

The observations that mRNA from genes involved in the biosynthesis of TIA compounds 
30 such as Tdc and Strt mRNAs coordinately accumulate in response to fungal elicitors, 
(methyl)-jasmonate t UV light, auxin depletion, phosphate depletion, and have similar 
tissue-specific distributions, have led the present inventors to hypothesise that the Tdc and 
Sfrl genes might be controlled by one or more common regulating factor(s) or 
substance(s). It was further hypothesised that among possible regulating factors, 
35 transcription factors could have such a regulating effect. 



22209DK1 t/FLV/HS/05-02-99 



It is known that certain transcription factors can regulate complex cell differentiation 
processes in animals involving numerous target genes. A notable example is muscle 
differentiation, where either one of a set of myogenic bHLH transcription factors (MyoD, 
5 myogenin, Myf5, MRF4) in combination with the MADS-domain transcription factor MEF2 
induces muscle cell differentiation and switches on numerous muscle-specific genes. Other 
examples include homeodomain transcription factors in the fruit fly that regulateeell 
processes resulting in the determination of segment identity. 

The pathway that is most extensively studied in plant secondary metabolism at the 
transcriptional level, is the one leading to the formation of the anthocyanin pigments. The 
genes encoding flavonoid biosynthetic enzymes are controlled by a combination of two 
distinct transcription factor species, one of which has homology to the protein encoded by 
the vertebrate proto-oncogene c-Myb t and the other with the vertebrate bHLH protein 
encoded by the proto-oncogene c-Myc. These transcription factors bind to specific 
sequences in the promoters of the target genes (Martin and Paz-Ares, 1997). 

The DNA-binding domain of plant MYB proteins consists of two, or for some of them, one 
imperfect repeat(s) (Martin and Paz-Ares, 1997). The MYC proteins have a bHLH-type 
(basic helix-loop-helix) DNA-binding domain and recognisevariants of the sequence 
CANNTG. About ten enzymes are involved in the biosynthesis of anthocyanins starting 
from phenylalanine. In maize, the entire set of genes encoding these enzymes are thought 
to be regulated coordinated by the Myc gene R and the Myb gene C1 in the aleurone 
(epidermal layer of the kernel endosperm), and by homologous genes in other parts of the 
25 plant (Holton and Cornish, 1995). 

Overexpression of the maize Lc gene encoding a MYC-type regulatory protein in Petunia 
upregulated the whole flavonoid biosynthetic pathway starting from Chs and including the 
earlier and later genes, resulting among others in intensely pigmented leaves (Bradley et 
30 a!. t 1998). The expression of the general phenylpropanoid genes Pat and C4h was not 
affected by Lc overexpression, indicating that Lc only regulates structural genes in the 
flavonoid branch. 

Introduction of the maize R and C1 MYC-type regulators in Arabidopsis intensified 
35 pigmentation in normally pigmented tissues and induced pigmentation in plant tissues that 



22209DK1 1/FLV/HS/05-O2-99 



6 

are normally unpigmented (Lloyd et al., 1992). In maize cell suspension, ectopic 
expression of C1 and R led to the accumulation of anthocyanins and a number of other 
related 3-hydroxy fiavonoids (Grotewold et al. t 1998). In addition, six anthocyanin structural 
genes that are targets for C1IR were expressed at high levels in the transgenic cell line. 

5 

In the plant Arabidopsis thaliana, it has been shown that overexpression of the transcription 
factor CBF-1 that belongs to the AP2 domain class transcription factors resulted in 
coordinate upregulation of a set of cold-regulated genes (Jaglo-Ottosen et al., 1998). 
However, there are no suggestions that transcription factors of this class may have an 
10 effect on the biosynthesis of metabolites in plant cells and other cells. 



fn 



The present inventors have now discovered that transcription factors having an AP2 DNA- 
binding domain are highly useful as central regulators of complex metabolite pathways 
involving numerous target genes for such transcription factors. This discovery has opened 
15 up for providing novel effective means of generating novel metabolite compounds, 

significantly enhancing the yield of commercially valuable metabolite compounds and also 
for enhancing the tolerance of plants towards exogenous stress factors and conditions. 



20 SUMMARY OF THE INVENTION 



Accordingly, the invention relates in one aspect to a method of modulating in a cell the 
expression of one or more genes involved in the biosynthesis of a metabolite or a 
precursor herefor, the method comprising inserting into the cell a nucleotide sequence 

25 coding for a transcription factor other than a transcription factor having a bHLH-type or a 
MYB-type DNA-binding domain, the nucleotide sequence is operably linked to at least one 
expression regulating sequence, and/or modifying the expression of a nucleotide sequence 
coding for such a transcription factor already present in the cell, the transcription factor is 
capable of regulating the expression of at least one gene coding for a gene product 

30 involved in the biosynthesis of said metabolite or precursor, and subjecting the cell to 
conditions where the inserted nucleotide sequence coding for a transcription factor is 
expressed or the expression of the nucleotide sequence already present in the cell is 
modulated. 
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In a further aspect the invention relates to a method of modulating the stress resistance of 
a cell, the method comprising inserting into the cell a nucleotide sequence coding for a 
transcription factor, the nucleotide sequence is operably linked to at least one expression 
regulating sequence, and/or modifying the expression of a nucleotide sequence coding for 
5 such a transcription factor already present in the cell, the transcription factor is capable of 
regulating the expression of at least one gene coding for a gene product involved in the 
biosynthesis of said metabolite or precursor, and subjecting the cell to conditions where the 
inserted nucleotide sequence coding for a transcription factor is expressed or the 
expression of the nucleotide sequence already present in the cell is modulated, the 
10 expression of said nucleotide sequence resulting in a modified responsiveness of the cell 
towards exogenous stress conditions. 

There is also provided a recombinant cell having, relative to its parent cell, enhanced or 
reduced biosynthesis of a metabolite or a precursor therefor, and/or enhanced or reduced 

1 5 expression of a gene product involved in metabolite production, the ceil comprising a 
nucleotide sequence coding for a transcription factor other than a transcription factor 
having a bHLH-type or a MYB-type DNA-binding domain, said transcription factor is 
capable of regulating the expression of at least one gene coding for a gene product 
involved in the metabolite production, said sequence is inserted into the cell and/or its 

20 expression is modified by operably linking it to a regulating sequence with which it is not 
natively associated. 



The invention pertains in a still further aspect to a method of producing a metabolite 
including a plant secondary metabolite, the method comprising providing a recombinant 
25 cell according to the invention, cultivating the cell under conditions where the nucleotide 
sequence coding for the transcription factor regulating the expression of at least one gene 
coding for a gene product involved in the metabolite production is expressed, and 
recovering the metabolite. 

30 There is also provided a method of constructing a recombinant cell according to the 
invention, the method comprising the steps of (i) identifying in a source cell a nucleotide 
sequence coding for a transcription factor other than a transcription factor having a bHLH- 
type or a MYB-type DNA-binding domain, said transcription factor is capable of regulating 
in the source cell the expression of at least one gene coding for a gene product involved in 

35 biosynthesis of a metabolite or a precursor therefor, (ii) isolating said nucleotide sequence 
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and (iii) inserting said isolated nucleotide sequence into a host cell comprising a gene 
coding for a gene product involved in biosynthesis of a metabolite or a precursor therefor, 
the insertion of said nucleotide sequence resulting in that the expression of the gene of the 
host cell is modulated relative to the expression level in a host cell not comprising the 
inserted nucleotide sequence. 



In still further aspects there are provided an isolated nucleic acid molecule comprising a 
nucleotide sequence coding for an amino acid sequence selected from the group 
consisting of SEQ ID NO:5, SEQ ID NO:6 and SEQ ID NO:7, a polypeptide which includes 
10 an amino acid sequence having at least 40% homology including at least 50% t such as at 
least 60%, at least 70%, at least 80% and at least 90% to an amino acid sequence 
selected from the group consisting of SEQ ID NO:5, SEQ ID NO:6 and SEQ ID NO:7 and a 
polypeptide having the amino acid sequence shown in SEQ ID NO:5, SEQ ID NO:6 or SEQ 
ID NO:7. 

15 

In other aspects, the invention pertains to a vector comprising a nucleic acid molecule 
according to the invention, a host cell comprising such a vector, and a method of isolating a 
nucleotide sequence coding for a transcription factor not having a bHLH-type or a MYB- 
type DNA-binding domain, said transcription factor is capable of regulating in a cell the 

20 expression of at least one gene coding for a gene product involved in biosynthesis of a 
metabolite or a precursor therefor, the method comprising the steps of (i) probing a cDNA 
or genomic library with a probe comprising at least a fragment of nucleic acid according to 
claim 1 17 or 1 18; (ii) identifying a DNA clone that hybridises with said nucleic acid; and 
(iii) isolating the DNA clone identified in step (ii) wherein the nucleic acid sequence is 

25 coding for all or a part of said transcription factor, a method of isolating a nucleotide 
sequence coding for a transcription factor not having a bHLH-type or a MYB-type DNA- 
binding domain, said transcription factor is capable of regulating in a cell the expression of 
at least one gene coding for a gene product involved in biosynthesis of a metabolite or a 
precursor therefor, the method comprising the steps of (i) synthesising an oligonucleotide 

30 primer set corresponding to at least a fragment of a sequence selected from the group 
consisting of SEQ ID NO:1, 2, 3 and 4; and (ii) amplifying cDNA or genomic DNA using 
said primer set in a polymerase chain reaction wherein the amplified nucleic acid fragment 
is coding for all or a part of said transcription factor to obtain a nucleotide sequence coding 
for a transcription factor not having a bHLH-type or a MYB-type DNA-binding domain. 

35 
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The invention also pertains to an isolated nucleic acid which is a nucleotide sequence as 
shown in SEQ ID NO: 8 and a mutant, allele or variant hereof and the use of this isolated 
nucleic acid sequence for isolating a nucleotide sequence coding for a transcription factor 
not having a bHLH-type or a MYB-type DNA-binding domain, said transcription factor is 
capable of regulating in a cell the expression of at least one gene coding for a gene 
product involved in biosynthesis of a metabolite or a precursor therefor. 



DETAILED DISCLOSURE OF THE INVENTION 



10 



A major objective of the present invention is to provide a method of modulating in a cell the 
expression of one or more genes involved in the biosynthesis of a metabolite or a 
precursor for the metabolite. As mentioned above, the method comprises that a nucleotide 
sequence coding for a transcription factor other than a transcription factor having a bHLH- 
15 type or a MYB-type DNA-binding domain is inserted into the cell such that the nucleotide 
sequence is operably linked to at least one expression regulating sequence, and/or that the 
expression of a nucleotide sequence coding for such a transcription factor already present 
in the cell is modified. 

20 In the present context, the term "modulating" refers to any change in the level of 

expression of a gene involved in the biosynthesis of a metabolite or a precursor for the 
metabolite relative to the level of expression in a corresponding cell type not having a 
modulated expression of the same gene or genes, i.e. in the wild type state. It will be 
appreciated that "modulating" may imply both reduced and enhanced levels of expression 

25 relative to that found in the wild type cell. 

As used herein the term "cell" encompasses any eukaryotic or prokaryotic cell. Thus, cells 
include bacterial cells, fungal cells including yeast cells, animal cells, dictyostelium cells, 
algae cells, and plant cells. The cell may be in the form of a single, isolated cell or it may in 
30 the form of multicellular tissues including an animal or a plant organ, plant callus tissues 
and an entire animal or plant. 



The expression "nucleotide sequence coding for a transcription factor" is used herein 
encompasses any nucleotide sequence coding for a transcription factor which does not 
35 comprise a bHLH-type or a MYB-type DNA-binding domain. It will be appreciated that such 
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a coding sequence may consist solely of coding DNA (ORFs) or, when it is derived from a 
eukaryotic cell, it may include introns. 

In this context, the expression "transcription factor" refers to a protein that in a sequence- 
5 specific manner binds to DNA by recognising specific elements located in the promoter 
and/or enhancer regions of the corresponding genes. The binding of such transcription 
factors to the specific DNA elements modulates the activity of other components of the 
transcription machinery including basal transcription factors and RNA polymerase, and 
thereby positively or negatively modulates the initiation of mRNA synthesis. Transcription 
10 factors have in common that they comprise DNA binding domains capable of binding to 
specific elements of their target genes. 

Transcription factors are classified in families based on conserved features of their DNA- 
binding domain. Wrthin each family, DNA-binding domains are similar, but not identical. 

15 Small differences in amino acid sequences can cause significant differences in the DNA 
sequence requirement for binding. As a result of these differences within a family, it is 
impossible to reliably predict whether a certain DNA sequence will bind a certain 
transcription factor family member with high affinity, and conversely, based on the amino 
acid sequence of a certain transcription factor family member it is impossible to reliably 

20 predict which sequence it will bind with high affinity. 

Transcription factors have a modular structure, consisting of domains with specialised 
functions, such as sequence-specific DNA binding, transcriptional activation, interaction 
with themselves or other proteins. These domains are in many cases functional even when 

25 fused to heterologous proteins. ORCA proteins contain the AP2-type DNA-binding domain. 
Since they activate transcription, it can be concluded that they contain an activation 
domain. It is conceivable that in e.g. Catharanthus cells the activity of AP2 domain class 
transcription factors is modulated via post-translational modifications (phosphorylation, 
myristoylation etc.) via interaction with other proteins, or combinations thereof. The activity 

30 of AP2 domain class transcription factors can be modulated using recombinant DNA 
techniques. The objectives could be to make the activity of the transcription factors 
independent from post-translational modifications, to enhance their activity, or to make the 
activity dependent on convenient inducing conditions. These objectives can be 
accomplished via single amino acid modifications, deletions of several amino acids, fusions 

35 with other proteins or protein domains, or combinations thereof. AP2 domain-containing 
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transcription factors may be constitutively targeted to the nucleus by addition of a nuclear 
localisation signal. Strong activation domains could be fused to the transcription factors in 
addition to or as a replacement for existing activation domains. Such activation domains 
include the Herpes simplex virus VP16 domain, the GAL4 activation domain, or synthetic 
5 activation domains known to be active in plants. Inducibility of transcription factor activity 
can be achieved via fusion with mammalian steroid hormone receptor domains. The 
resulting chimera can be activated by steroid hormone addition. 

In accordance with the above method of the invention, the nucleotide sequence coding for 
1 0 the transcription factor is operably linked to at least one expression regulating sequence. In 
the present context, the term "operably linked" refers in general to a situation where a first 
nucleic acid sequence is linked with a second nucleic acid sequence and the first nucleic 
acid sequence is placed in a functional relationship with the second nucleic acid sequence. 
For instance, a promoter is operably linked to a coding sequence if the promoter affects the 
15 transcription or expression of the coding sequence (i.e. that the coding sequence is under 
the transcriptional control of the promotor). Generally, operably linked nucleic acid 
sequences are contiguous and, where necessary to join two protein coding regions, in 
reading frame. However, operably linked nucleic acid sequences may also be in a 
functional relationship without being contiguous. Typical examples of "expression 
20 regulating sequences" include promoter sequences, enhancer sequences, translation 
leader sequences, sequences coding for anti sense RNA and 3* non-coding sequences. 

It will be understood that the term "promoter" includes both constitutive promoters and 
regulatable promoters. In respect of the latter type of promoters, the regulation is typically 
25 conferred by exogenous (extrinsic) or endogenous (intrinsic) factors including physical 
factors such as the temperature or light, chemical factors including the pH, the presence of 
inducer substances, absence/presence of nutrient compounds in the cell environment. 

In one presently preferred embodiment, the cell in which the expression of one or more 
30 genes involved in the biosynthesis of a metabolite or a precursor herefor is a plant cell. 

The term "plant cell" as used herein include protoplasts, fused cells, gamete producing 
cells, undifferentiated and differentiated cells, and cells which regenerate into whole plants. 
Accordingly, a seed comprising multiple plant cells capable of regenerating into a whole 
35 plant, is included in the definition of "plant cell". In useful embodiments, the plant cell is in 
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the form of a cell culture. As used herein, the term "plant" refers to any organism of the 
kingdom Plantae, including Bryophytes, Pteridophytes, Spermatophytes (Gymnosperms 
and Angiosperms, including monocotyledonous and dicotyledonous), thallophytes 
(including algae such as Chrysophyta, Pyrrophyta, Euglenophyta, Rhodophyta, 
5 Phaeophyta and Clorophyta), a whole plant, any plant part or propagule thereof, seed or 
hybrid progeny and descendants, a plant cell, a multiplicity of plant ceils such as e.g. plant 
tissues, and plant cells in the form of a cell culture. Plantlets are also included within the 
meaning of "plant". 

10 In specific embodiments, the method relates to plant cells of a species of the Gentianales 
order or the Cornafes order such as a species selected from an indole alkaloid producing 
species. Such species include species of the Apocynaceae, Alangiaceae, Loganiaceae, 
Icacinaceae, Cornaceae and Rubiaceae families. One particularly interesting genus of the 
Apocynaceae family is the genus Catharanthus including the species C. roseus, C. 

15 coriaceus, C. lanceus, C. longifolius, C. ovalis, C. pusillus, C. scitulus and C, trichophyltus. 

In accordance with the invention, the nucleotide sequence coding for a transcription factor 
as defined above can be derived from any cell type comprising such a sequence. 
Particularly useful sources for such sequences are plant cells of any of the above types 
20 and forms. 

In one interesting embodiment, the nucleotide sequence coding for a transcription factor is 
a sequence coding for a transcription factor comprising at least one AP2 domain. 
Transcription factors comprising such a DNA-binding domain are generally referred to as 
25 AP2/EREBP transcription factors which constitute a class of transcription factors that at 
present have only been found in plants. They posses two features that are typical for 
transcription factors, namely sequencers pecific DNA binding capability and ability to 
activate (or repress) transcription. (Riechmann and Meyerowitz, 1998). 



30 Sequence similarity among the AP2/EREBP proteins is mostly limited to the AP2 DNA- 
binding domain (APETALA2 domain) itself (approximately 68 aa), which is the only region 
conserved among all proteins of the family. The distinguishing characteristic of proteins of 
the AP2/EREBP family is that they contain either one or two AP2 domains. Thus this family 
can be divided into two subfamilies based on whether the proteins contain one or two AP2 

35 domains, the EREBP subfamily and the AP2 subfamily, respectively. The AP2 domains in 
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the AP2 subfamily are more related among members of this subfamily than to the AP2 
domains of proteins of the EREBP subfamily and vice versa. However, many 
characteristics are common to the AP2 domains from both subfamilies. Two conserved 
segments are found within each AP2 domain, which have been referred to as the YRG 
5 element, and the RAYD element. The amino terminal part of the AP2 domains (the YRG 
element) is basic and hydrophilic. The carboxy terminal RAYD element contains a central 
region that, in almost all AP2 domains, is predicted to adopt the configuration of an alpha- 
helix of amphipathic character. Solution structure determination of one particular 
Arabidopsis AP2 domain protein revealed that the YRG element and the next 20 to 30 
10 amino acids form three anti-parallel p-sheets that contact the DNA, whereas the RAYD 
element indeed adopts an ct-helical structure as predicted (Allen et al, 1998). 

As used herein an AP2 DNA-binding domain is defined as an amino acid sequence having 
at least 35% amino acid residue identity such as at least 45%, at least 55%, at least 65%, 
1 5 at least 75%, at least 85%, at least 95% or at least 98% residue identity to any of the AP2 
domains described in Riechmann and Meyerowitz (1998). However, in the present context, 
any amino acid sequence having similarity to any of the two conserved segments, the YRG 
element and the RAYD element as defined by Riechmann and Meyerowitz (1998) are 
encompassed by the invention and considered as having an AP2 DNA-binding domain. 

20 

In another embodiment, the transcription factor as used herein includes at least two AP2 
domains such as at least three AP2 domains: 

In specific embodiments, the method is based on the use of a transcription factor coding 
25 sequence comprising at least one sequence coding for an AP2 domain selected from AP2 
domains of SEQ ID NO:5, SEQ ID IMO:6 or SEQ ID NO:7 or a sequence coding for an 
amino acid sequence selected from SEQ ID NO:5, SEQ ID NO:6 or SEQ ID NO:7. 



It will be appreciated that in accordance with the invention, a functional AP2 domain class 
30 transcription factor can be derived from a source cell naturally containing such a factor. 
However, it is contemplated that a transcription factor that is functional in accordance with 
the invention can be constructed either by isolating one or more AP2 domains from a 
naturally occurring transcription factor and combining it with an amino acid sequence with 
which it is not naturally associated. Alternatively, an AP2 domain may be constructed 
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synthetically and combined with further amino acids to provide a transcription factor active 
polypeptide. 

In accordance with the method of the invention, the nucleotide sequence may be a 
5 sequence coding for an amino acid sequence comprising an at least one AP2 domain 
having at least 40% homology, at least 50%, at least 60%, at least 70%, at least 80% or at 
least 90% homology to any of the at least one AP2 domains of SEQ ID NO:5, SEQ ID 
NO:6 and SEQ ID NO:7. 

10 As used herein, the term "homology" refers to homology at the amino acid level both in 
terms of amino acid identity and similarity. Similarity allows for "conservative variation", i.e. 
substitution of one hydrophobic residue such as isoieucine, valine, leucine or methionine 
with another hydrophobic residue, or the substitution of one polar residue with another 
polar residue, such as arginine with lysine, glutamic acid with aspartic acid, or glutamine 

15 with asparagine. In the present context, similarity is defined and determined using the 
present version of the BLAST program (Altschul et al f 1997). 

In other specific embodiments, the nucleotide sequence coding for the transcription factor 
comprises at least one AP2 domain-coding sequence selected from SEQ ID NO:1, SEQ ID 
20 NO:2, SEQ ID NO:3 or SEQ ID NO:4 or the nucleotide sequence coding for the at least 
one AP2 domain is a mutant, allele, derivative or variant of a nucleotide sequence coding 
for an AP2 domain of a sequence selected SEQ ID NO:1 , SEQ ID NO:2, SfcU IU NO:3 or 
SEQ ID NO:4. 

25 In the present context, the terms "mutant", "allele", "derivative" or "variant" may be by one 
or more of addition, insertion, deletion or substitution of one or more nucleotides in the 
nucleotide sequence, leading to the addition, insertion, deletion or substitution of one or 
more amino acids in the encoded polypeptide while substantially maintaining biological 
activity of the encoded polypeptide. Also included are changes in the nucleotide sequences 

30 which make no difference to the encoded amino acid sequence. 



Variant, allelic, mutant or derivatised nucleotide sequences can e.g. be produced by 
standard DMA mutagenesis techniques or by chemically synthesising the variant DNA 
molecule. Such variants, alleles, mutants or derivatives do not change the reading frame of 
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the protein-coding region of the nucleotide sequence and encode a protein having no or 
only minor changes in its biological function. 

In accordance with the method of the invention, the nucleotide sequence coding for a 
transcription factor, the expression of which is modulated, is either a homologous 
nucleotide sequence or a heterologous nucleotide sequence. As used herein, the term 
"homologous" refers generally to a nucleotide sequence present or originating in the same 
species and the term "heterologous" refers correspondingly to a sequence originating from 
a different species. 



10 



It will be appreciated that when the sequence is a homologous sequence, the expression of 
the sequence may be modulated while it is in its natural location in the cell, but it is also 
contemplated that a homologous coding nucleotide sequence may be translocated within 
the cell of origin. Whether homologous or heterologous, the transcription factor-encoding 
15 nucleotide sequence may be operably linked to a regulating nucleotide sequence with 
which it is not natively associated. 

In one step of the above method, a nucleotide sequence coding for a transcription factor is 
inserted into the cell. The insertion can be carried out using any conventional methods of 

20 inserting nucleic acid into cells. Such transformation methods include as examples 

microprojectile bombardment, microinjection, electroporation, liposome mediated uptake 

a nd tran sformation-by-means o f a disarmed Tt-plasm id v e ctor carri e d by Agrob a ct e rium . 



Numerous plant transformation vectors are available for plant transformation, and genes 
encoding transcription factors according to the invention can be used in conjunction with 

25 any such vectors. The selection of vectors for use will depend upon the preferred 
transformation technique and the target species for transformation. For certain target 
species, different antibiotic or herbicide selection markers may be preferred. Selection 
markers used routinely in transformation include the nptll gene which confers resistance to 
kanamycin and related antibiotics the bar gene which confers resistance to the herbicide 

30 phosphinothricin, the hph gene which confers resistance to the antibiotic hygromycin, the 
dhfrgene which confers resistance to methotrexate and the Tdc gene, which confers 
resistance to 4-methyltryptophan. 



Also many vectors are available for plant transformation using Agrobacterium tumefaciens, 
35 These typically carry at least one T-DNA border sequence and include vectors such as 
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P BIN19, pCAMBIA, pMOG402 and P MOG22. Typical vectors contain DNA borders for 
/\grooacterium-mediated transformation, the RK2-derived tra function for mobilisation 
between £. coli and other hosts, and the OriT and OriV functions that are also from RK2. 
They usually contain a polylinker that is suitable for the cloning of plant expression 
5 cassettes containing their own regulatory signals. 

Transformation without the use of Agrobacterium tumefaciens can also be performed which 
circumvents the requirement for T-DNA sequences in the chosen transformation vector, 
and consequently vectors lacking these sequences can be utilised in addition. The choice 
1 0 of vector depends largely on the preferred selection for the species being transformed. 

In accordance with the invention, gene sequences intended for expression in transgenic 
plants can be assembled in expression cassettes behind a suitable promoter and upstream 
of a suitable transcription terminator. These plant expression cassettes can then by use of 
1 5 standard methods be transferred to the plant transformation vectors described above. 

The selection of a promoter used in expression cassettes will determine the spatial and 
temporal expression pattern of the transgene in the transgenic plant. Selected promoters 
will express transgenes in specific cell types (such as leaf epidermal cells, mesophyll cells, 

20 root cortex cells) or in specific tissues or organs (roots, leaves or flowers, for example) and 
this selection will reflect the desired location of expression of the transgen e. Alternatively, 
the selected promoter may drive expression of the gene under a light-induced or other 
temporally regulated promoter. A further alternative is that the selected promoter be 
chemically regulated. This provides the possibility of inducing expression of the transgene 

25 only when desired and caused by treatment with a chemical inducer. 

A variety of transcriptional terminators are available for use in expression cassettes. They 
are responsible for the termination of transcription beyond the transgene and its correct 
polyadenylation. Appropriate transcriptional terminators are those which are known to 
30 function in plants and include the CAMV 35S terminator, the tml terminator, the nopaline 
synthase terminator, and the pea rbcs E9 and 3C terminators. They can be used in both 
monocotyledons and dicotyledons. 

It is a further objective of the present invention to provide the means of enhancing 
35 biosynthesis of metabolites in a cell as defined above. Although it is contemplated that the 
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discoveries on which the present invention is based is applicable to any of these cell types 
and to any metabolite synth sised in such cells by genes the expression of which are 
regulated by transcription factors as defined herein, the method of the invention is 
presently preferred for the optimisation of biosynthesis of plant metabolites, in particular 
5 secondary metabolites including those mentioned above. However, It is also contemplated 
that the method of the invention can be used for the biosynthesis of novel and presently 
unknown metabolites, including novel plant secondary metabolites. 

Thus, the secondary metabolites the biosynthesis of which can be enhanced using the 
10 method of the invention include alkaloid compounds including indole alkaloids such as 
terpenoid indole alkaloids (TIAs), phenolic compounds and terpenoid compounds. A review 
of secondary plant metabolites is included in e.g. Phytochemical Dictionary, A Handbook of 
Bioactive Compounds from Plants, edited by Harborne and Baxter, Taylor & Francis, 
London and Washington, DC. In useful embodiments, the secondary metabolites are 
1 5 alkaloids derived from compounds such as tryptophan (indole alkaloids including vincristine 
and vinblastine), tyrosine (isoquinoline alkaloids including berberine, berbamine and 
stephamine), lysine (quinolizidine alkaloids including lupamine and sparteine), ornithine 
(tropane alkaloids including scopolamine and atropine), nicotinic acid, anthranilic acid and 
acetate. 

20 

In accordance with the invention the transcription factor as used in the above method is 
capable of regulating the expression of at least one gene coding for a gene product 
involved in metabolite biosynthesis. Typically, such a gene product is a protein such as an 
enzyme. In a useful embodiments the enzyme is involved in an alkaloid biosynthetic 
25 pathway including an enzyme that is selected from tryptophan decarboxylase (TDC), 

geraniol-10-hydroxylase (G10H), strictosidine synthase (STR) or strictosidine glucosidase 
(SG). 

As mentioned above, the present invention has made it possible to modulate in a cell the 
30 expression of one or more genes involved in the biosynthesis of a metabolite or a 

precursor for the metabolite. Although in presently preferred embodiments the expression 
of the gene or genes is enhanced with the objective of improving the yield of a desired 
metabolite, it is also contemplated that it may, under certain conditions, be advantageous 
to be able to reduce the expression of one or more biosynthetic genes. The enhancement, 
35 or alternatively, the reduction of the gene expression, relative to a cell into which a 
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nucleotide sequence coding for a transcription factor is not inserted or which is without 
modified expression of the air ady present nucleotide sequ nc coding for a transcription 
factor, is obtained by modifying the expression level of the nucleotide sequence coding for 
the transcription factor as defined herein. 

5 

In a specific embodiment, the expression in the cell of one or more genes involved in the 
biosynthesis of a metabolite or a precursor herefor is enhanced by at least 1 0%, at least 
25%, at least 50%, at least 75% or at least 100%, relative to a cell into which a nucleotide 
sequence coding for a transcription factor is not inserted or which is without modified 
1 0 expression of the already present nucleotide sequence coding for a transcription factor. It 
will be understood that "enhancement of gene expression" is related to the increase in 
biosynthesis of the metabolite. It is also contemplated that the enhancement of gene 
expression can be higher such as by a factor of at least 1.5 including at least 2, at least 5 
or at least 10. 

15 

The enhancement of the expression of one or more genes involved in the biosynthesis of a 
metabolite or a precursor herefor can, in accordance with the invention, be achieved in 
either of two ways. One way is by inserting into the cell a nucleotide sequence coding for a 
transcription factor as defined herein and operably linking it to at least one expression 

20 regulating sequence, the cell comprising at least one target biosynthetic gene for the 

inserted transcription factor. An alternative way is by modifying the expression of a 

nucleotide sequence coding for such a transcription factor already present in the cell. In 
either of such ways, the modification results in that the transcription factor is produced in 
higher amounts than in a cell of a corresponding type that is not modified by insertion of a 

25 transcription factor-encoding sequence or by modifying the expression of such a sequence 
already present in the cell. It is evident that the expression of the biosynthetic gene is 
enhanced in a cell type not previously containing the transcription factor provided the cell 
contains a biosynthetic gene comprising a target sequence for the transcription factor. 
However, the expression level of an inserted transcription factor-encoding nucleotide 

30 sequence can also be enhanced by in vivo or in vitro modifications. 

In accordance with the invention, enhancement of the expression of a transcription factor 
encoding sequence ("upregulation") can be obtained by any known technique leading to 
enhanced gene expression. As an example, the coding sequence can be linked to a 
35 stronger promoter or the native promoter of the sequence can be structurally modified so 
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as to improve its promoter strength. The expression may also be enhanced by modifying 
any other regulating sequence operably linked to the coding sequence e.g. by mutation. An 
alternative approach is to modify the nucleotide sequence of the transcription factor- 
encoding sequence. 

5 

A number of different sequences have been found to enhance gene expression from within 
the transcriptional unit and these sequences can thus be used in conjunction with the 
transcription factor-encoding nucleotide sequences according to the invention to increase 
their expression in transgenic plants. Various intron sequences have been shown to 
10 enhance expression, particularly in monocotyledonous cells. Intron sequences have been 
routinely incorporated into plant transformation vectors, typically within the non-translated 
leader. 

A number of non-translated leader sequences derived from viruses are also known to 
15 enhance expression, and these are particularly effective in dicotyledonous cells. 

Specifically, leader sequences from Tobacco Mosaic Virus, Maize Chlorotic Mottle Virus 
and Alfalfa Mosaic Virus have been shown to be effective in enhancing expression. 

Where the expression of the at least one gene involved in the metabolite biosynthesis is 
20 reduced ("down-regulated"), the reduction is typically by at least 10%, at least 25% or at 
least 100%, relative t o a cell into which a nucleotide sequence coding for a transcription 
factor is not inserted or which is without modified expression of the already present 
nucleotide sequence coding for a transcription factor. It is also contemplated that the 
reduction of gene expression can be higher such as by a factor of at least 1 .5 including at 
25 least 2, at least 5 or at least 1 0. 

Down-regulation of the expression of the transcription factor-encoding sequence can be 
obtained in several ways. Thus, as an example the expression is reduced by using anti- 
sense technology or "sense regulation" ("co-suppression"). In using anti-sense genes or 

30 partial gene sequences to down-regulate gene expression, a nucleotide sequence is 

placed under the control of a promoter in "reverse orientation" such that transcription yields 
RNA which is complementary to normal mRNA transcribed from the sense strand of the 
target gene. An alternative approach is to use a copy of all or part of the target gene 
inserted in sense, that is the same, orientation as the target gene, to achieve reduction in 

35 expression of the target gene by co-suppression. 
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In another aspect of the invention there is provided a method of modulating the stress 
resistance of a cell, the method comprising inserting into the cell a nucleotide sequence 
coding for a transcription factor, the nucleotide sequence is operably linked to at least one 
5 expression regulating sequence, and/or modifying the expression of a nucleotide sequence 
coding for such a transcription factor already present in the cell, the transcription factor is 
capable of regulating the expression of at least one gene coding for a gene product 
involved in the biosynthesis of said metabolite or precursor, and subjecting the cell to 
conditions where the inserted nucleotide sequence coding for a transcription factor is 
1 0 expressed or the expression of the nucleotide sequence already present in the cell is 
modulated, the expression of said nucleotide sequence resulting in a modified 
responsiveness of the cell towards exogenous stress conditions. 

In the present context, the expression "exogenous stress conditions" should be understood 
15 as meaning any condition that results in that the growth and function of the cell is impaired 
relative to optimum conditions for growth and function. In relation to a plant, such stress 
conditions include attack of the plant by a pathogen, abiotic adverse environmental 
conditions like cold or heat and drought (osmotic stress), insufficient or excess supply of 
nutrient compounds, insufficient or excess levels of plant hormones, UV light, low light 
20 levels and suboptimal pH conditions. 

In accordance with the invention the cell having modulated stress resistance may be any of 
the eel! types as mentioned above in which at least one gene involved in the biosynthesis 
of a metabolite or a precursor for the metabolite is modulated as described above and to 
25 which there is referred. In particularly useful embodiments the cells having modulated 
stress resistance are plant cells of the above types and species including whole transgenic 
plants. 

In accordance with the present method of modulating stress resistance, the expression of 
30 at least one biosynthetic gene is modulated by modifying the expression of any type of 
transcription factors that has a target sequence in the cell. Thus, in one embodiment, the 
nucleotide sequence coding for a transcription factor is a sequence coding for a 
transcription factor comprising at least one AP2 domain, including at least two AP2 
domains or at least three AP2 domains of any of the above types and having any of the 
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above sequences including such sequences that are operably linked to a regulating 
nucleotide sequence with which it is not natively associated. 

In accordance with the invention the plant cell which is modified to have enhanced stress 
5 resistance may be modulated in its expression of any of the above metabolites or gene 
products involved in metabolite biosynthesis. 

Similar to what is described above in the context of the method of modulating in a cell the 
expression of one or more genes involved in the biosynthesis of a metabolite or precursor 

10 herefor, the present method includes that the expression in the cell of one or more genes 
involved in the biosynthesis of a metabolite or a precursor herefor is enhanced or reduced 
relative to a cell into which a nucleotide sequence coding for a transcription factor is not 
inserted or which is without modified expression of the already present nucleotide 
sequence coding for a transcription factor. Such an enhancement or reduction is achieved 

1 5 mutatis mutandis as it is described for the former method. 

In a further aspect, the invention provides a recombinant cell having, relative to its parent 
cell, enhanced or reduced biosynthesis of a metabolite or a precursor therefor, and/or 
enhanced or reduced expression of a gene product involved in metabolite production, the 
20 cell comprising a nucleotide sequence coding for a transcription factor other than a 
transcription factor having a bHLH-type or a MYB-type DNA-binding domain, said 



transcription factor is capable of regulating the expression of at least one gene coding for a 
gene product involved in the metabolite production, said sequence is inserted into the cell 
and/or its expression is modified by operably linking it to a regulating sequence with which 

25 it is not natively associated. In useful embodiments such a cell is a plant cell of any of the 
types and species as mentioned above and modified as also described above. In other 
useful embodiments such a celt is a cell wherein at least one pathway leading to the 
biosynthesis of at least one precursor for the metabolite can be stimulated in the cell in the 
presence of an inducing agent. Such inducing agents include chemical compounds (e.g. an 

30 elicitor, jasmonate and a hormone) and physical factors (e.g. light, temperature, pH and 
shear stress). 

In another aspect, the invention provides a method of producing a metabolite including a 
plant secondary metabolite, the method comprising providing a recombinant cell as 
35 described above, cultivating said cell under conditions where the nucleotide sequence 
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coding for the transcription factor regulating the expression of at least one gene coding for 
a gene product involved in the metabolite production is expressed, and recovering the 
metabolite. The cultivation of the recombinant cell of the invention is carried out using any 
conventional cultivation media and any conventional culturing methods for the particular 

5 type of cell. Such conventional culturing methods also includes methods where the cell is 
cultivated under conditions where at least one precursor for the metabolite is added in a 
form which can be assimilated by the cell. In useful embodiments such precursors may 
include compounds such as e.g. tryptophan, tyrosine, lysine, ornithine, nicotinic acid, 
anthranilic acid and acetate. Also included are culturing methods where the recombinant 

1 0 cell of the invention is further genetically modified in order to stimulate the production of a 
precursor for the metabolite, and methods wherein a compound which stimulates the 
production of a precursor for the metabolite is added to the cell in form which can be 
assimilated by the cell. 

1 5 In a particular aspect of the invention there is provided a method of constructing a 

recombinant cell as described above, the method comprising the steps of (i) identifying in a 
source cell at least one nucleotide sequence coding for a transcription factor other than a 
transcription factor having a bHLH-fype or a MYB-type DMA-binding domain, said 
transcription factor(s) is/are capable of regulating in the source cell the expression of at 

20 least one gene coding for a gene product involved in biosynthesis of a metabolite or a 
precursor therefor, (ii) isolating said nucleotide sequence and (iii) inserting said isolated 



nucleotide sequence(s) into a host cell comprising a gene coding ror a gene product 

involved in biosynthesis of a metabolite or a precursor therefor, the insertion of said 
nucleotide sequence(s) resulting in that the expression of the gene of the host cell is 
25 modulated relative to the expression level in a host cell not comprising the inserted 

nucleotide sequence^). In useful embodiments, at least two isolated nucleotide sequences 
are inserted into the host cell. 

A typical non-limiting example of how the method is carried out is described in the below 
30 examples. 

In further aspects, the invention provides an isolated nucleic acid molecule comprising a 
nucleotide sequence coding for an amino acid sequence selected from the group 
consisting of SEQ ID NO:5, SEQ ID NO:6 and SEQ ID NO:7 including such an isolated 
35 nucleic acid that comprises a nucleotide sequence which is selected from the group 
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consisting of SEQ ID NO:1 , SEQ ID NO:2, SEQ ID NO:3 and SEQ ID NO:4 and a mutant, 
allele or variant hereof and a polypeptide which includes an amino acid sequence having at 
least 40% homology including at least 50%, such as at least 60%, at least 70%, at least 
80% and at least 90% to an amino acid sequence selected from the group consisting of 
5 SEQ ID NO:5, SEQ ID NO:6 and SEQ ID NO:7. 

There is also provided a polypeptide having the amino acid sequence shown in SEQ ID 
NO:5, SEQ ID NO:6 or SEQ ID NO:7. 

10 The invention also relates to a vector comprising a nucleic acid molecule as defined above 
including such as a vector that comprises at least one further sequence coding for a 
transcription factor and to a host cell comprising such a vector. The host cell is selected 
from the group consisting of a bacterial cell, a fungal cell, a yeast cell, an animal cell, a 
unicellular eukaryotic cell, an algae cell and a plant cell. 

15 

As mentioned above, the invention also provides a method of isolating a nucleotide 
sequence coding for a transcription factor not having a bHLH-type or a MYB-type DNA- 
binding domain. A typical example of a probe that is useful in such a method is a fragment 
of a nucleic acid as shown in SEQ ID NO:1, SEQ ID NO:2, SEQ ID NO:3 or SEQ ID NO:4 
20 and a mutant, allele or variant hereof. 

The inventi on a l so provid es-a-meth o d of iso l ati n g a nuc le otid e sequence cod i ng for a 

transcription factor not having a bHLH-type or a MYB-type DNA-binding domain by 
using a nucleotide sequence as shown in SEQ ID NO:1, 2, 3 and 4 in a polymerase chain 
25 reaction. 

In other aspects, there are provided an isolated nucleic acid having the nucleotide 
sequence as shown in SEQ ID NO: 8 and a mutant, allele or variant hereof, and a method 
using this nucleic acid sequence for isolating a nucleotide sequence coding for a 
30 transcription factor not having a bHLH-type or a MYB-type DNA-binding domain. 

The invention will now be further illustrated by the following non-limiting examples and the 
drawings wherein 
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Fig.1 shows that the RV region of the Sfr1 BH promoter is required for elicitor and 
jasmonate responsiveness. Two transformed cell lines for each construct were 
subjected to a control treatment (C), or induced with purified elicitor (PE) or methyl- 
jasmonate (MJ). Constructs were introduced in C. roseus cell lines MP183L and BIX 
5 via particle bombardment and Agrobacterium, respectively; 

Fig. 2 illustrates that the RV region of the Sfr1 promoter is an autonomous elicitor and 
jasmonate responsive element. Two transformed cell lines for each construct were 
given a control treatment (C) or induced with purified elicitor (PE), or methyl- 
10 jasmonate (MJ). Constructs were introduced in C. roseus cell line MP183L via particle 
bombardment; 

Fig. 3 shows that mutations M2, 3, 4, and 5 in the RV region abolish elicitor and 
jasmonate responsiveness of the Sfrl BH promoter. Two transformed cell lines for 

15 each construct were given a control treatment (C) or induced with methyl-jasmonate 
(MJ). Constructs were introduced in C. roseus cell line BIX via Agrobacterium; 
Fig. 4 shows that mutations M3, 4, and 5 in the RV region abolish elicitor and 
jasmonate responsiveness of the Sf/-1 BH promoter and that mutant M2 weakens the 
responsiveness. Two transformed cell lines for each construct were subjected to a 

20 control treatment (C) or induced with purified elicitor (PE) or methyl-jasmonate (MJ). 



Constructs were introduced in C. roseus cell line MP183L via particle bombardment; 

Fig. 5 shows Orca-J and Orca-2 expression patterns in elicited C. roseus suspension 
cultures. C. roseus cells were treated with 50 //M methyl-jasmonate (MJ), 0.5% crude 
25 yeast extract elicitor (YE) or with an equivalent amount of partially purified elicitor 
(PE). Cells were harvested after the time points indicated in the figure. RNA was 
extracted and Northern blots were probed with DNA sequences specific for Orca-1 or 
Orca-2; 



30 Fig. 6 illustrates that T-DNA activation tagging of the Orca-3 gene activates 
expression of the TIA biosynthetic genes tryptophan decarboxylase (TDC), 
strictosidine synthase (STR) and strictosidine glucosidase (SG). Northern blot analysis 
of RNA extracted from line 46 and a 4-mT-resistant control cell line (C). Blots were 
probed with Tdc, Sfr1 and Sg (6A). RNAs in figure 6B were hybridised with the 
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rescued flanking plant DNA from line 46, containing the complete Orca-3 open reacting 
frame; 

Fig. 7 shows Orca3 expression patterns in elicited C. roseus suspension cultures. C. 
5 roseus cells were treated with 50 //M methyl-jasmonate <MJ), 0.5% crude yeast 

extract elicitor (YE) or with an equivalent amount of partially purified elicitor (PE). Cells 
were harvested after the time points indicated in the figure. RNA was extracted and 
Northern blots were probed with the rescued flanking plant DNA from line 46, 
containing the complete Orca-3 open reading frame; 

10 

Fig. 8 illustrates that ORCA-1 and 2 are transcriptional activators of the Strl -promoter 
in C. roseus cells. GUS/CAT ratios with standard deviation were calculated from 
duplicate enzyme activity measurements of three bombarded samples arising from one 
batch of coated particles in a representative bombardment experiment. Tungsten 
15 particles were coated with a mixture of plasmids containing BH-GUS, empty 

pMOG184, or pMOG184 carrying Orca-1 or Orca~2 in sense or anti-sense orientation 
with respect to the CaMV 35S promoter. To correct for transformation efficiency, a 
plasmid carrying CaMV35S-CAT was added-to all coatings,^ 

20 Fig.9 illustrates that ORCA-1 and 2 transcriptionally activate the Sth -promoter in C. roseus 
cells in a sequence-specific manner. GUS/CAT ratios with standard deviation were 
calculated from duplicate enzyme activity measurements of three bombarded samples 
arising from one batch of coated particles in a representative bombardment experiment. 
Tungsten particles were coated with a mixture of GUS plasmids and pMOG184 derivatives. 

25 The GUS gene was driven by the Strt BH promoter or BH mutant derivatives M1 through 
M6. pMOG184 plasmids were either empty (pM), or carried Orca^ (01) or Orca-2 (02) in 
sense orientation with respect to the CaMV 35S promoter. To correct for transformation 
efficiency, a plasmid carrying CaMV35S-CAT was added to all coatings; 

30 Fig. 10 shows that ORCA-3 activates expression of the SfrlNH promoter {-202 to -1 
with respect to ATG start codon). C. roseus cells were bombarded with tungsten 
particles coated with a mixture of plasmids containing Str\ NH-GUS, empty 
pMOG184, or pMOG184 carrying Orca-3 in sense orientation with respect to the 
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CaMV 35S promoter, and stained for GUS enzyme activity by addition of the 
substrate X-Gluc; 

Fig. 1 1 shows that ORCA-3 activates expression of the Tdc -219 promoter (-219 to 
5 +86 with respect to ATG start codon). C. roseus cells were bombarded with tungsten 
particles coated with a mixture of plasmids containing Tdc -219-GUS, empty 
pMOG184, or pMOG184 carrying ORCA-3 in sense orientation with respect to the 
CaMV 35S promoter, and stained for GUS enzyme activity by addition of the 
substrate X-Gluc; and 

10 

Fig. 1 2 shows that ORCA-1 , -2 and -3 activate expression of the jasmonate- 
responsive RV element from the Srr1 -promoter. C. roseus cells were bombarded with 
tungsten particles coated with a mixture of plasmids containing 4RV-GUS, empty 
pMOG184, or pMOG184 carrying 0rca-1, -2 or -3 in sense orientation with respect to 
15 the CaMV 35S promoter, and stained for GUS enzyme activity by addition of the 
substrate X-Gluc. 



EXAMPLES 



20 



The techniques applied in the below examples are all standard recombinant DMA and- 



molecular cloning techniques well known in the art and are e.g. described by Sambrook et 
al. (1989) Molecular cloning: A laboratory manual, second edition, Cold Spring Harbor 
Laboratory Press and Ausubel et al. (1987) Current protocols in molecular biology, Greene 
25 Publishing Associates and Wiley Interscience, New York. 
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EXAMPLE 1 

1.1 Identification of an elicitor- and iasmonate-responsive elem ent in the Sf/i promoter 

The RV region of the promoter of the strl gene (DDBJ/EMBL/GenBank database 
accession number Y10182) of Catharanthus roseus (Madagascar periwinkle) coding for 
strictosidine synthase was, as described below, via promoter analysis including detailed 
scanning mutagenesis, identified as an elicitor- and jasmonate-responsive element. 



22209DK1 1/FLV/HS/06-02-99 




27 

The Bglll/Hincll fragment (BH) of the Strt promoter and a BH fragment in which the 
Rsal/Avall fragment (RV) was deleted (BH" RV ) were introduced as Bglll/Sall fragments into 
the GusSH vector consisting of pBluescript SK+ carrying the (^-glucuronidase reporter 
5 gene. These Sf/i promoter derivatives fused to the gusA gene were introduced into two 
different C. roseus cell lines. Cell line MP183L was transformed via particle bombardment, 
whereas cell line BIX was transformed via an improved Agrobacterium strain (Memelink, 
unpublished results, see example 4.1). For Agrobacterium-medlaied transformation, the 
GusSH derivatives were introduced into the binary vector pMOG22lambdaCAT, a 

10 derivative of pMOGIambdaCAT containing the hygromycin resistance gene instead of the 
kanamycin resistance gene as a plant selectable marker. Northern blot analysis was 
performed for two independently transformed cell lines for each construct, each estimated 
to consist of a population of at least 100 to 1000 independent transformants. The BH 
fragment of the Sfrt promoter conferred low basal level expression onto the gusA gene in 

15 the control (0.05% dimethylsuifoxide (DMSO)) treated cells (Fig.1). Incubation of MP1 83L 
cells transformed with BHGusSH with partially purified yeast extract elicitor (PE; Menke et 
al., 1 999) and methyl-jasmonate (MJ) for 6 h induced the expression of the gusA gene 
several fold (Fig. 1 upper panel). These conditions similarly induced the endogenous Sfri 
gene as described previously (Menke et al.,1999). The BH^ fragment did not confer 

20 elicitor- and MJ-responsive gene expression onto the gusA reporter gene in cell line 

MP183L (Fig.1 upper panel). Similar results were obtained with BIX cell lines transformed 
with these Sf/1 -promoter derivatives fused to gusA. BIX cells carrying the BH-gusA fusion 
had low basal gusA mRNA levels, but incubation for 6 h with MJ induced high gusA gene 
expression (Fig.1 lower panel). BIX cells carrying BH-RV fused to gusA did not have 

25 detectable basal expression levels and could not be induced by MJ (Fig.1 lower panel). 
This loss of function experiment shows that the RV region of the promoter is required 
for elicitor- and jasmonate-responsive gene expression. 

In a gain of function experiment it was tested whether the RV fragment of the Strt promoter 
30 was able to confer elicitor-and JA-responsive gene expression onto a minimal promoter- 
gusA fusion. To this end tetramers of the RV region were constructed and fused to a 
Caulifower Mosaic Virus 35S promoter truncated to -47 and fused to gusA in the vector 
GusSH-47. The RV region was cloned as a Rsal/Avall fragment into the Smal/EcoRV sites 
of plC20H, and tetramerized using the enzymes BamHI/Bglll. The tetramer was cloned as 
35 a BamHI/Bglll fragment into the BamHI site of GusSH-47. C. roseus cell line MP183L was 
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transformed with this construct via particle bombardment. Control cell lines were generated 
by introducing GusSH-47 or 6Tcyt-GusSH-*7 in two independently transformed cell lines 
for each construct. The 6Tcyt construct contains 6 head-to-tail copies of the cyt-1 cis-acting 
element from the T-DNA T-CYT gene which is not elicitor- or jasmonate-responsive. 

5 Northern blot analysis was performed for two independently transformed cell lines, each 
estimated to consist of a population of at least 100 to 1000 independent transformants. 
Incubation of the 4RV-47 cell lines with PE or M J for 6 h resulted in induction of gusA gene 
expression as compared to the expression of the control (0.05% DMSO) incubated cells 
(Fig.2). The same treatment of cell lines carrying the -47 construct did not result in 

10 measurable gusA mRNA levels in the control or induced samples (Fig.2). In the cell lines 
carrying the 6Tcyt-GusSH-47 construct, high basal levels of gusA mRNA were found 
(Fig.2). Treatment of these cell lines with PE or MJ for 6 h did not induce expression 
(Fig.2). These results show that the RV region is sufficient to confer elicitor- and 
jasmonate-responsive gene expression onto an otherwise inactive minimal promoter-gusA 

15 construct. The results with the 6Tcyt lines show that the RV region does not potentiate a 
silent elicitor-responsive element within the CaMV -47 region or elsewhere on the 
construct. 



Detailed block scanning mutagenesis of the RV region in the context of the BH fragment of 
20 the Strt promoter was performed to pinpoint the exact location of the elicitor- and 

jasmonate-responsive-sequence within the RV fragment Mutations were introduced by 
PCR with primers having a 6 bp mismatch combined with Ml3-reverse pnmer using the 
Xhol/Hincll fragment of the SM promoter cloned in plC20H as a template. The primers 
RVM1-RVM8 are disclosed in the below Table 1.1 



25 



Table 1.1 Primers used in detailed block scannin g mutagenesis of the RV region 



RVM1 



RVM2 



RVM3 



RVM4 



RVM5 



RVM6 



RVM7 



RVM8 



5-CCACGTGGTTGTAGTCTCTTAGACC-3' 



■V^nTACATCA GAGAATGACCGCCTTC^" 

... — ■ I'SS A A A /-> Ol 



5 -CACTCTTACTGGCGCTTCTTTGAAAG-3' 
5'-AGACCGCGAAGAATGAAAGTG-3' " 



5' -AGACCGCCTTCTTACTTTCTGATTTCCCC-3' 
5'-TTGAAAGACTAAACCCTTGGAC-3' 



«V.nAAAGTGAI I IGGOAACGACCTTG^y 

^ i ■ ' I ^-n. j^-r^s A S~\ Ol 



S'-CCCTTGCTGGAAG I I I GGTGAG-3 7 



PCR fragments obtained were used as primers in PCR reactions together with M13- 
forward primer on the Xhol/Hincll fragment of the Sfrl promoter cloned in plC20H. PCR 
30 fragments obtained were reamplified using M1 3-reverse and M13-forward primers. 
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Amplified PCR products were digested with Bglll/Sall and cloned into GusSH digested with 
BamHI/Sail. In total 8 mutated versions of the RV fragment were made, covering the entire 
fragment from the Rsal site to the Avail site. In each mutant 6 adjacent nucleotides (nt) 
were changed into their complementary nt, i.e. changing A to T and G to C. The 8 mutated 
5 versions of the BH fragment fused to gusA were introduced into C. roseus cell lines 
MP183L and BIX as described above. Northern blot analysis was performed for two 
independently transformed cell lines for each mutant construct, that were incubated for 6 h 
with DMSO or MJ for C. roseus BIX cell lines (Fig.3) and with DMSO, PE or MJ for C. 
roseus MP183L cell lines (Fig.4). By comparing basal and MJ-induced gusA gene 

10 expression for ail the mutant BIX lines to the expression conferred by wildtype BH fragment 
it is clear that mutations 1, 6, 7 and 8 conferred a wildtype gene expression pattern (Fig.3). 
In mutant 2 the expression was only minimally inducible. In mutants 3, 4 and 5 expression 
was uninducible, and similar to the expression conferred by the BH-RV fragment (Fig.3). 
The experiment with the MP183L lines carrying the mutated BH fragment gave similar 

1 5 results (Fig.4). Mutants 1 ,2 and 6 showed expression patterns similar to the wildtype BH 
fragment. Mutants 3, 4 and 5 are not consistently inducible by PE or MJ and gave similar 
expression patterns to the BH-RV fragment. 

In another experimental series, the effect of the block mutations was studied in constructs 
20 consisting of tetramers of RV derivatives fused to GusSH-47. Tetramers of RV mutants M2 t 
M3 ? M4, M5 and M6 were constructed as described above for RV wildtype fragment, and 
fused to a Caulifower Mosaic Virus 35iS promoter truncated to -47 fused to gusA in the 
vector GusSH-47. The constructs were introduced in C. roseus cell line MP183L via 
particle bombardment. Two independent transformed cell lines were generated for each 
25 construct. Incubation of the cell lines carrying RV wildtype or mutant M6 tetramer 

constructs with PE or M J for 6 hours resulted in induction of gusA gene expression as 
compared to the expression of the control (0.05% DMSO) incubated cells (results not 
shown). The same treatment of cell lines carrying the mutant M2, M3, M4 or M5 tetramer 
constructs did not result in measurable gusA mRNA levels in the control or induced 
30 samples (results not shown). 

These results show that the elicitor- and MJ-responsive element is contained within the 24 
bp sequence that is covered by mutations 2 to 5, with mutations 3, 4 and 5 having the most 
dramatic negative effect on the inducibility of the BH or the RV fragment of the Sfrl 
35 promoter. 
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EXAMPLE 2 

r.innin g of ORCA-1 and 2 

5 

The RV region was, as described in Example 1. identified as an elicitor- and jasmonate- 
responsive element in the Sfrl promoter. Use of the RV element as bait in a one-hybrid 
transcription factor screening resulted in the isolation of 17 cDNA clones that corresponded 
to two different mRNA species, encoding proteins belonging to the APETALA2 domain 
10 class of transcription factors. They were named ORCA-1 and 2 (for Octadecanoid- 
Responsive Catharanthus AP2). 

The RV region of the Sfrl promoter (DDBJ/EMBL/GenBank database accession number 
Y10182) was cloned as a Rsal/Avall fragment into the Smal/EcoRV sites of plC20H and 
15 tetramerized using the enzymes BamHI/Bglll. The tetramerwas cloned as a BamHI/Bglll 
fragment into the BamHI site of pHIS3NX. The tetramer-HIS3 gene fusion was transferred 
as a Notl/Xbal fragment to the yeast integration plasmid plNT1. 

The resulting plasmid was linearized with Ncol and introduced into yeast strain Y187. 
20 Recombinants were selected on YPD medium containing 150 ug/ml G418, and the 
occurrence of a single recombination event between the plNT1 derivative and the 



chromosomal PDC6 locus was verified via Southern blotting. Yeast strain Y187-4RV was 
then used in a one-hybrid screen for DNA-binding proteins with a cDNA library of 
Catharanthus roseus cloned in the plasmid pACTII. To construct the library, cDNA was 
25 synthesised with a Stratagene kit on 2 different poly(A) RNA preparations mixed in a 1 :1 
ratio isolated from cell suspensions that were elicited with 0.05 % yeast extract for 1 and 4 
h, respectively. The cDNA was cloned in the EcoRI/Xhol sites of lambdaACTII. The 
amplified lambda library, consisting of 3.5 x 10 6 independent primary transformants was 
converted to a pACTII plasmid library via Cre-lox-mediated in vivo excision in Cre- 
30 expressing in E. coli strain BNN132. Four million Y1 87-4RV transformants were screened 
in a one-hybrid screening. Plasmids isolated from 364 colonies obtained on SD medium 
without histidine and leucine were retransformed to Y187-4RV. and two Y187 derivatives 
carrying tetramers of RV mutant fragments RVM4 and RVM5 fused to the HIS3 gene. 
Seventeen plasmids showed strongly reduced growth in Y187-4RVM4 compared to Y187- 
35 4RV and Y187-4RVM5, whereas all other plasmids did not give rise to significant 
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differences in growth between the three yeast strains. Partial sequencing of these plasmids 
revealed that they belonged to one of only two classes. The ORCA-1 class consisted of 3 
plasmids carrying cDNA sequences with identity to the sequences in SEQ ID Nos 1 and 2 . 
SEQ ID No:1 is derived from plasmid RV23, whereas SEQ ID No:2 is derived from plasmid 
5 RV124. Both sequences are identical throughout the main open reading frame, but diverge 
at their 3* ends. Obviously, they are derived from the same gene, and the differences could 
have arisen by alternative splicing of the primary transcript. They contain the complete 
coding region, since SEQ ID Nos. 1 and 2 contain an in frame stop codon preceding the 
first start codon. The ORCA-2 class consists of 14 plasmids carrying cDNA sequences with 

10 identity to the sequence in SEQ ID No:3. All ORCA-2 cDNA sequences appeared to be 
partial. The missing portion was isolated via PCR with the primers 4TH (5*- 
CCCCACCAAACCCAAAAAAAG-3') and RV117 (S'-CCATATCCTCGATCCTTTTCTC-S 1 ) 
using the pACTII cDNA library as a template. A prominent 0.6 kbp PCR band was digested 
with EcoRI and BamHI, and cloned in pBluescript II SK+. A complete clone was 

15 constructed in pBluescript II SK+ by fusion of the PCR fragment with the cDNA fragment 
from plasmid RV210 using the unique BamHI site. 

EXAMPLE 3 
20 Expression of ORCA-1 and -2 



To analyse the expression of the Orca-1 and Orca-2 genes suspension-cuiturea u. roseus 
cells were incubated with M J (50 pM) or PE (0.05% YE equivalent). After incubation 
periods ranging from 1 to 28 h the cells were harvested and immediately frozen in liquid 

25 nitrogen. Total RNA was isolated and Northern blots were probed with a 700 bp EcoRI 
fragment from SEQ ID NO:1 , which is specific for Orca-1 , or a 450 bp EcoRI/Kpnl fragment 
from SEQ ID NO:3 f specific for Orca-2. For Orca-1 the control sample, incubated with 
DMSO for 6 h, had high levels of Orca-1 mRNA, but even higher levels were induced by 
MJ after 4 h (Fig.5). MJ-induced gene expression was clear for Orca-2 with high levels of 

30 expression at 2 h as compared to the control. The level of mRNA remained high up to 28 h 
(Fig.5). Both Orca-1 and Orca-2 were rapidly induced by PE, with an increase of mRNA 
levels within 1 h for Orca-2 and within 2 h for Orca-1 (Fig.5). The induction of Orca-1 and 
Orca-2 gene expression peaked at 2 to 3 h after which the amount of both mRNAs 
declined. The same induction patterns were found for crude yeast extract (YE) (Fig.5). 
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These results show that both Orca-1 and Orca-2 are inducible by MJ and PE, with Orca-2 
being induced most rapidly. 

EXAMPLE 4 
5 Cloning of ORCA-3 

T-DNA activation tagging has been used as a reverse genetic method to isolate genes 
involved in plant growth and development. Genes involved in responses to ABA and 
cytokinin have been identified using this technique. A T-DNA carrying a strong 
10 transcriptional enhancer is integrated into the plant genome by Agrobacterium 

tumefaciens-med\B\ed transformation. Expression of genes adjacent to the integration site 
will become deregulated, which can result in a dominant mutation. 

To isolate dominant mutants with increased expression of TIA (terpenoid indole alkaloid) 
15 biosynthetic genes, the Tdc gene was used as selectable marker. TDC is involved in the 
conversion of L-tryptophan into tryptamine, one of the first steps in TIA biosynthesis. TDC 
is able to use substrates other than L-tryptophan, like 4-methyltryptophan (4-mT). This 
compound is toxic to plant cells, and is converted by TDC in to the non-toxic 4- 
methyltryptamine. We used 4-mT to select for C. roseus cell lines in which the activation 
20 tag causes an increase in Tdc expression. The selection for cell lines with increased Tdc 
expression is described below 



4.1 Plasmids and Agrobacterium tum efaciens strain 

25 Plant cell transformations were done by applying the ternary vector system (van der Fits e 
a!., manuscript in preparation). A Kpnl fragment carrying a constitutive virG mutant gene 
was cloned onto plasmid pBBR1 MCS and this plasmid was used as ternary vector 

A brief outline of the cloning of the binary vector Tag-2B4A1 follows: The 35S promoter 
30 derivative 2B4A1 , containing multiple copies of the B and A1 enhancers from the CaMV 
35S promoter, was cloned on a pUC21 plasmid containing the Hpt gene driven by a 
deletion derivative of the FMV 34S promoter ( 34S-Hpf) and the gusA reporter gene. This 
complete plasmid (including pUC21 sequences) was digested at a unique Bglll site and 
cloned into the binary vector pSDM14 cut with BamHI. The 35S promoter is now reading 
35 towards the right border sequence. 
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Both plasmids (binary and ternary vectors) were introduced into Agrobacterium 
tumefaciens strain LBA4404 via triparental mating using pRK2013 as a helper strain. 

5 4.2 Transformation of Catharanthus roseus suspension cultured ce lls 

Agrobacterium tumefaciens strains were grown for 3 days at 28°C on solid AB medium 
containing 20 pg/ml rifampicin, 100 ug/ml kanamycin and 75 ng/ml chloramphenicol. From 
these bacteria a liquid overnight culture was grown in the same medium. Cells were 
10 harvested by centrifugation and resuspended at a OD 60 o of 1 in LS medium of pH 5.2 
supplemented with 2 mg/l NAA, 0.2 mg/l kinetin, 3% sucrose. 1% glucose and 100 uM 
acetosyringone (hereafter called cocultivation medium). 

C. roseus cells (cell line BIX) are grown in LS medium supplemented with 3% sucrose, 0.2 
15 mg/l kinetin and 2 mg/l NAA (hereafter called LS13) at 25°C (16/8 h light/dark cycle) and 
subcultured every 7 days by transferring 7.5 ml of cells in 50 ml medium. For 
transformation 7 days old cells were used. The culture medium was replaced by 
cocultivatiofHnedium and the cell density was adjusted to 50% sedimentation volume. On 
a petridish (diam. 80 mm) containing 25 ml of cocultivation medium solidified with 0.7% 
20 plant tissue culture agar, 7.5 ml of this-C. roseus suspension was mixed with 750 f-tl 

Agrobacterium t umvf u uiuns cu lt u re . Dishes were tap o d using l e ucopor tape and put a t 

25°C in the dark for 3 days. After the cocultivation the cells were washed and plated on a 
paper filter disc (diam. 75 mm, Whatmann, No.4). These filters were placed in petridishes 
containing 25 ml LS13 with 50 ug/ml hygromycin-B, 400 pg/ml cefotaxim, 100 pg/ml 
25 vancomycin and 0.4 mM 4-methyl-tryptophan. Filters were transferred onto fresh medium 
weekly. 

4.3 Isolation of a 4-mT resistant C. roseus line with increased TIA biosvnthetic ge ne 
expression 

30 

An estimated number of 400.000 to 500.000 stable, independent C. roseus transformants 
were generated by Agrobacterium tumefaciens-mediated transformation with the tagging 
construct Tag-2B4A1 . Transformants were placed on selective medium containing 50 
jag/ml hygromycin-B, 400 pg/ml cefotaxim, 100 \tgfm\ vancomycin and 0.4 mM 4-mT. After 
35 several weeks of selection 281 independent 4-mT-resistant calli were obtained. These calli 
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were transferred into liquid medium containing 50 ^g/ml hygromycin-B and 0.2 mM 4-mT. 
From these suspension cultures RNA was extracted. Northern blot analysis showed that 
several mutants had an increased Tdc mRNA level (data not shown), which in all likelihood 
is causing the 4-mT resistance. One these mutants (line 46) is described in more detail 
5 below. 

For line 46 Northern blot analysis was performed using other genes from the TIA 
biosynthetic pathway. RNA extraction and Northern blot analysis were performed. Ten ug 
RNA samples were loaded onto the gels. As 32P-labeled probes, cDNAs encoding TDC, 
10 STR (Pasquali et al., 1992), and SG (Geerlings, Memelink, van der Heijden and Verpoorte, 
unpublished results) were used. 

Besides Tdc, also Strt and Sg mRNA levels were increased significantly in the tagged cell 
line 46, when compared to a 4-mT resistant control cell line (number 38) (Fig.6A). Thus in 
1 5 line 46, the T-DNA tag appears to have activated a central regulator of TIA biosynthetic 
gene expression, thereby inducing the expression of at least three genes, and possibly the 
complete TIA biosynthetic pathway. Chromosomal DNA was extracted from 50 gram fresh 
weight of cells from suspension line 46 and isolated by CsCI-Ethidium bromide 
ultracentrifigation. Southern blots were made and the gusA gene was used as 32P-labeled 
20 probe isolated from GusSH. Southern blot analysis of line 46 showed that only one T-DNA 
copy was integrated into the plant genome (data not shown). 

For plasmid rescue, 10 ^g genomic DNA of line 46 was digested with Xbal (a unique site in 
tagging construct Tag-2B4A1), iigated and transformed to electrocompetent cells of E.coli 
25 strain NM554. 

Plasmid analysis of the obtained E.coli transformants identified the rescued plasmid 46X8, 
which carried 1 .6 kb of plant DNA immediately flanking the right border of the tagging 
construct. Using the rescued sequence as a probe on a Northern blot of RNA isolated from 
30 line 46 and control line 38, one hybridising mRNA species was observed (Fig.6B). This 
transcript was absent in control cell line 38, whereas it was present at a high level in line 
46. 

Sequence analysis of plasmid 46X8 revealed an open reading frame (ORF), located at 
35 approximately 600 bp downstream of the start of the rescued plant sequence. Homology 
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searches in databases revealed one AP2 domain present in the predicted amino acid 
sequence of this ORF. 



10 



AP2 domain containing proteins are a class of transcription factors exclusively found in 
plants. The encoded protein was called ORCA-3 (for octadecanoid-responsive 
Catharanthus AP2 domain). Several AP2 domain transcription factors, all containing one 
conserved AP2 domain, have been shown to participate in stress signalling (Pti, EREBP, 
CBF1 ( DREB). CBF1 has been demonstrated to act as central regulator of cold-induced 
gene expression in Arabidopsis thaliana. In analogy, the tagged gene described here, 
appears to encode a central regulator of TIA biosynthetic gene expression. 



The sequence rescued on plasmid 46X8 is of genomic origin, and may be interrupted by 
intron sequences. To isolate a cDNA copy of the mRNA transcribed from the tagged gene, 
a PCR approach was followed. Primer sequences were based on DNA sequence 

1 5 information from the rescued plasmid. Primer OR1 (5'- 

GAATTCATATGGCGGAAAGCTGTCAGGAGGATTC) was combined with the vector T3 
primer and OR4 (5-CGACGTCGTAGAAGGCTCCGCAGGG) with the, vector M 13-40 
primer in PCR reactions using lambda DNA prepared from a C. roseus root cDNA library 
(Pasquali et al., 1992) as a template to isolate the 3' respectively 5' parts of the ORCA3 

20 cDNA. Based on DNA sequence data from these partial cDNA clones, primers OR5 (5* 
AGATCTCATAGTTCCGAAGAAATC-ATTTCCGTCTCAG) and OR6 (5* 
AGACTCGTGAACTTTTTTGGATATAAAATTTTGTAC-ATTCC) were developed, which 
were used in a PCR on the rescued 46X8 plasmid to obtain the complete ORCA-3 open 
reading frame. The cDNA and amino acid sequences are provided in SEQ ID NO;4 and 

25 SEQ ID NO:7, respectively. Homology with other AP2 domain proteins, including the 
Catharanthus proteins ORCA-1 and ORCA-2 described above, is restricted to the AP2 
domain. Sequences outside this domain are not conserved. 



30 EXAMPLES 

The tagged ORCA-3 gene is induced bv elicitor and me thvHasmonate 

Previous research showed that the octadecanoid pathway, leading to the synthesis of 
jasmonate, is involved in elicitor-induced expression of TIA biosynthetic genes (Menke et 
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al., 1999). Treatment of C. roseus cells with methyl-jasmonate (MJ) thus resulted in a 
transient increase of both Tdc and Strt mRNA levels. 

The T-DNA tagging experiment in Example 4.3 showed that the Tdc, Strt and Sg genes 
5 are regulated by ORCA-3. Based on the fact that two other Catharanthus AP2 domain 
proteins. ORCA-1 and 2, are involved in the elicitor- and jasmonate-responsive expression 
of the Strt gene (Example 2), it was speculated that ORCA-3 may be a third elicitor- and 
jasmonate-responsive transcription factor. 

10 Northern blots were loaded with RNA from C. roseus cells treated for various time points 
with MJ. crude yeast extract elicitor (YE) and with a partly purified elicitor from yeast extract 
(PE). Subsequently, blots were probed with plasmid 46X8. containing the tagged gene. 
Treatment of cells with MJ resulted in a rapid increase in expression of the tagged gene 
(Fig.7). This increase was already observed within 2 h after addition of MJ and sustained 

15 for at least 28 h. The induction of Tdc and Strt expression was slower (maximal at 4 h after 
treatment) and was also maintained for at least 28 h (Menke et al.. 1999). Because of the 
induction by a product from the octadecanoid pathway, the tagged gene was named Orca- 
3 (Octadecanoid Responsive Catharanthus AP2-domain protein). Incubation of cells with 
crude (YE) and partly purified (PE) elicitor (Fig.7) resulted in a rapid and transient increase 

20 of Orca-3 mRNA levels, although the expression level is low relative to MJ-induced levels. 
-H,o inrinrtion of gene expression is at approxim ately 2 h after addition of the 



elicitor. The maximal induction of Tdc and Strt expression by the elicitors occurs later 
(maximal induction after 6-8 h; Menke etal.. 1999). 

25 The Northern blot analysis shows that Orca-3 expression is elicitor- and jasmonate- 
responsive and precedes the induction of the TIA biosynthetic genes, indicating that 
ORCA-3 is an elicitor- and jasmonate-responsive transcription factor that coordinated 
controls TIA biosynthetic gene expression. 



30 



EXAMPLE 6 

Rinriino site of ORCA-1 and 2 with in the RV region of Str promoter 

To identify the ORCA binding site within the RV region, Y187 yeast derivatives that 
35 contained tetramers of the RV wildtype fragment and mutants M1 through M6 (see below 
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and Table 1.1 for mutations) fused to the HIS3 gene were constructed via single cross-over 
in the PDC6 locus as described above. 

The sequence of the RV region (SEQ ID NO:8) of the Str\ promoter is shown below. 
5 Mutated blocks are underlined or indicated in bold face. Nucleotides within each block were 
changed into their complementary nucleotides. 

M2 M4 M6 

GTACATCACTCTTA GACCGC CTTCT TTGAAAG TGATT TCCCTTG GACC 
10 Ml M3 M5 M7 

The yeast strains were then separately transformed with pACTII derivatives RV124 
(encoding ORCA-1 fused to the GAL4 activation domain) and RV210 (encoding part of 
ORCA-2 including the AP2 domain fused to the GAL4 activation domain). Transformants 

15 were plated on SD minimal medium without leucine but containing histidine. Colonies 

obtained after 5 days were streaked on SD plates without histidine and leucine, and growth 
was recorded after 5 days (Table 6.1). Growth indicates that ORCA proteins bound the RV 
derivative and activated HIS3 gene transcription, whereas lack of growth indicates that 
ORCA proteins failed to bind the RV derivative. The results indicate that ORCA-1 and 2 

20 failed to confer growth in yeast containing mutants M3 and M4 fused to the HIS3 gene, in 
these mutants, blocks of 6 nucleotides are changed in their complementary n ucleotides 
within the sequence GACCGCCTTCTT. This sequence is similar to the binding site 
identified for tobacco EREBP proteins (Ohme-Takagi and Shinshi, 1995) and tomato Pti 
proteins (Zhou et aL, 1997) within PR gene promoters and for Arabidopsis CBF and DREB 

25 proteins within COR gene promoters (Stockinger et aL, 1997). 
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TABLE 6.1 

Growth of veast Y187 derivatives containing different RV mu tants fused to the HIS3 gene 
transformed with ORCA-1 or ORCA-2 expressing pACTII plasmids. 



RV derivative 


ORCA-1 (RV124) 


ORCA-2 (RV210) 


RV 


+ 


+ 


M2 


+ 


+ 


M3 






M4 






M5 


+ 




M6 


+ 


+ 



5 



EXAMPLE 7 

ORCA-1 and 2 are transcriptional activators of the Sfrl -promo ter in C. roseus cells 

1 0 To investigate whether ORCA-1 and 2 can act as transcriptional activators interacting with 
the Sfrt -promoter, C. roseus cells were transiently cotransformed with a Sfrl-promoter- 
gusA fusion (BH-GusSH) construct , an overexpression vector (pMOG184) carrying Orcal 
or Orca2 cDNAs fused in sense or antisense orientation to the CaMV 35S promoter and a 
overexpression vector carrying the chloramphenica! acetyl transferase gene (car). As a 

1 5 control, cotransformation of BH-GusSH with an empty overexpression vector was done. 
The CaMV 35S-cat construct served as an internal control for transformation efficiency. 

C ell s w e re transform e d throug h partic l e bomb a r d m e nt as de s cri b ed before , using the three 

constructs in a ratio of 1:1:3 (gus:cat:orca). Twenty four hours after transformation, cells 
were harvested and frozen in liquid nitrogen. GUS and CAT activity assays were performed 

20 and the Gus reporter gene expression was expressed as a GUS/CAT ratio to correct for 
transformation efficiency. Expression of Orca-1 in sense orientation resulted in a higher 
GUS activity (gus/cat ratio, 1 .6 fold induction) as compared to the control (BHGusSH 
+pMOG 184) (Fig. 8). Expression of Orca-1 in antisense orientation did not alter the GUS 
activity level (Fig. 8). Expression of Orca-2 in sense orientation resulted in a 3.3 fold 

25 induction of GUS activity, whereas expression of Orca-2 in antisense orientation had little 
or no effect on GUS activity as compared to the control transformed cells (BHGusSH 
+pMOG 184) (Fig. 8).These results show that transient overexpression of either ORCA-1 or 
2 (in sense orientation) enhances the expression of the gusA reporter gene driven by the 
Str-promoter fragment BH. 
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To show that ORCA-1 and 2 enhance the expression driven by the Strt -promoter in a 
sequence-specific manner, co-transformations were done using wild-type and mutant 
versions of the SM -promoter (fused to gusA). ORCA-1 enhanced the expression of the 
gus reporter gene to some degree through interaction with wild-type and mutant versions 1, 
5 2, 5 and 6 of the BH fragment of the Strt -promoter, as compared to the control pMOG184) 
(Fig. 9). ORCA-1 did not enhance the expression of reporter constructs driven by mutant 
versions 3 and 4 of the Sf/i -promoter fragment BH (Fig. 9). ORCA-2 strongly enhanced the 
expression of gus reporter gene through interaction with wild type and mutant version 1, 
and 6 of the BH fragment of the Sfvi-promoter (Fig. 9), as compared to the control 

10 (pMOG184). ORCA-2 enhanced gus gene expression weakly for mutants 2 and 5 (Fig. 9). 
ORCA-2 did not enhance the expression of reporter constructs driven by mutant versions 3 
and 4 of the Sfr-promoter fragment BH (Fig. 9). These results show that the activation of 
gene expression by ORCA-1 and 2 is sequence-specific and occurs through direct 
interaction with the Sf/1 -promoter fragment RV. The sequence requirement for ORCA 

15 activation in plant cells is slightly different from that in yeast cells (see example 6). In yeast, 
only mutations M3 and M4 affect the ability of ORCA's to activate gene expression, 
whereas in plant ceils mutations M2 and M5 additionally affect gene expression to some 
degree. These differences may reflect differences in binding in the two celt types, for 
instance due to interaction with different endogenous proteins. On the other hand, the 

20 transient co-bombardment data closely correlate with the results of the stable 

transformation (Example 1), indicating that indeed lack of ORCA binding is the reason for 
lack of expression of mutants M2, 3, 4 and 5 in stably transformed lines. 



25 EXAMPLE 8 

ORCA-3 is a transcriptional activator of the Tdc and Strt promoters in C. roseus ce lls 

To investigate whether ORCA-3 activates expression from the Tdc- and Sf/i -promoters, a 
particle bombardment assay was performed. C. roseus cells were co-bombarded with the 
30 gwsy4-reporter gene with or without an overexpression vector containing the Orca-3 cDNA 
in the sense orientation. The gusA-reportergene was driven by a fragment of the Tdc- 
promoter (-219 to +86 as referred to the translations! start site) or the SM -promoter (-202 
to -1 as referred to the translational start site). 
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24 h after bombardment the cells were stained for GUS activity. A clear increase in the 
number and intensity of blue spots was observed when the gusA-reporter gene was co- 
bombarded with 35S-Orca3, when compared to a negative control in which the cells were 
co-bombarded with the empty overexpression vector. This enhancement was observed 
5 both with the Tdc-promoter fragment (Fig. 1 1) as well as with the Sfrt-promoter fragment 
(Fig. 10) driving reporter gene expression. As a control, GUS activity from an unrelated 
promoter (6Tcyt) was not increased after co-bombardment with Orca-3 (data not shown). 
Thus in this transient transformation assay, Orca-3 activates gene expression from the 
Tdc- and Sth -promoter in a sequence-specific manner. 



10 



EXAMPLE 9 

ORCA-1. -2 and -3 activate expression nf the iasmona te-responsive RV element from the 
SfH -promoter 



15 



The transient co-bombardment assay, as described above, was used to investigate 
whether the ORCA proteins activate gene expression of the jasmonate-responsive RV 
element from the -promoter. C. roseus cells were bombarded with a gusA-reporter 
gene driven by a tetramer of the RV element (4RV) in combination with the empty 
20 overexpression plasmid, 35S-Orca1, 35S-Orca2 or 35S-Orca3. The number and intensity 
of b l u e s pot s was in c rea s ed cle a rly u pon co-bombardment with one of the three Orca 



genes, when compared to control cells, which were co-bombarded with the empty vector 
(Fig. 12). The strongest activation of gusA expression was observed when Orca-2 or Orca- 
3 were used. However, to make statements about the relative strengths of activation of the 
25 3 different ORCAs, quantitative analysis is necessary, thereby correcting for differences in 
particle coating and bombardment efficiencies. In addition, the differences in activation may 
be due to differences in expression levels of the three ORCA proteins, instead of 
differences in activation strength. 



30 



EXAMPLE 10 

ORCA-3 does not activate gene expression of RV mutants M? and M3. and weakly for 
mutants M4 and M5 
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35S-Orca3 was co-bombarded with the 4RV mutant series (described in Example 1) to 
determine which sequences in the RV element were important for the activation by Orca3. 
After co-bombardment cells were stained for GUS activity and blue staining was quantified 
visually by estimating the number and intensity of blue spots. The results are shown in the 
5 below Table 10.1. 
TABLE 10.1 

Relative GUS activities as determined visual ly 



Cells bombarded 
with 


Relative GUS staining 


4RV + pMOG 


2 


2 


2 


4RV + 35S-Orca3 


4 


4 


4 


4M2 + pMOG 


1 


1 


1 


4M2 + 35S-Orca3 


1.5 


1 


1 


4M3 + pMOG 


0.5 


0.5 


0.5 


4M3 + 35S-0/ta3 


0.5 


0.5 


1 


4M4 + pMOG 


0.5 


0.5 


0.5 


4M4 + 35S-Orca3 


1 


1 


1 


4M5 + pMOG 


0.5 


0.5 


0.5 


4M5 + 35S-Orca3 


1.5 


1.5 


2 


4M6 + pMOG 


3 


2.5 


2.5 


4M6 + 35S-Orca3 


4 


4 


4 



The above Table 10.1 shows the relative GUS activities as determined visually. The C. 
roseus cells were co-bombarded in triplicate with the gusA reporter gene driven by 4RV or 
a mutant of this element together with an overexpression plasmid carrying the Orca3 cDMA 
(35S-Orca3) or the empty vector (pMOG). Cells were stained for GUS activity. The 
10 numbers in Table 10.1 are an estimation of the intensity and number of blue spots. Values 
range from 0 (no blue spots) until 4 (many intense blue spots). 

No activation of GUS activity was observed when the gusA reporter gene was driven by a 
tetramer of RV mutant M2 or M3. For mutants M4 and M5 f the basal GUS activity was 

15 lower, when compared to the wild-type. However, ORCA-3 was able to activate gene 
expression of these mutants. Expression of mutant M6 resembled the wild-type, both in 
basal GUS activity levels, as well in inducibility by ORCA-3. Thus ORCA-3 is able to 
activate gene expression from the RV element and sequences that are disrupted in 
mutants 2 and 3 are important for this sequence specific activation. Again, quantitative 

20 analysis is necessary to make conclusive statements about the exact sequence 

requirements for activation of gene expression. The preliminary results indicate that the 
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sequence requirements for activation of gene expression in plant cells by ORCA-3 are the 
same as for ORCA-1 and 2. , 
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CLAIMS 

1. A method of modulating in a cell the expression of one or more genes involved in the 
biosynthesis of a metabolite or a precursor herefor, 

5 

the method comprising inserting into the ceil a nucleotide sequence coding for a 
transcription factor other than a transcription factor having a bHLH-type or a MYB-type 
DNA-binding domain, the nucleotide sequence is operably linked to at least one expression 
regulating sequence, and/or modifying the expression of a nucleotide sequence coding for 
1 0 such a transcription factor already present in the cell, the transcription factor is capable of 
regulating the expression of at least one gene coding for a gene product involved in the 
biosynthesis of said metabolite or precursor, and subjecting the cell to conditions where the 
inserted nucleotide sequence coding for a transcription factor is expressed or the 
expression of the nucleotide sequence already present in the cell is modulated. 

15 

2. A method according to claim 1 wherein the cell is a plant cell 

3. A method according to claim 2 wherein the*celtfs of a dicotyledonous species. 

20 4. A method according to claim 3 wherein the cell is of a species selected from the group 
consisting of the Gentianales order and the Cornates order. 

5. A method according to claim 3 wherein the cell is of a species selected from an indole 
alkaloid producing species. 

25 

6. A method according to claim 4 or 5 wherein the celt is a species selected from the group 
consisting of species belonging to the Apocynaceae, Afangiaceae, Loganiaceae, 
Icacinaceae, Comaceae and Rubiaceae families. 

30 7. A method according to claim 6 wherein the cell is of a species belonging to the genus 
Catharanthus, 

8. A method according to claim 7 wherein the cell is a Catharanthus roseus cell. 
35 9. A method according to claim 2 wherein the plant cell is in the form of a cell culture. 
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10. A method according to claim 2 wherein the plant cell is in a plant tissue. 

11. A method according to claim 1 wherein the nucleotide sequence coding for a 
5 transcription factor is derived from a plant cell. 

12. A method according to claim 1 1 wherein the cell is derived from a dicotyledonous 
species. 

10 13. A method according to claim 12 wherein the cell is derived from a species selected 
from the group consisting of species of the Gentianales order and species of the Cornales 
order. 

14. A method according to claim 12 wherein the cell is derived from an indole alkaloid 
15 producing species. 

15. A method according to claim 13 or 14 wherein the cell is derived from a species 
selected from the group consisting of species belonging to the Apocynaceae, Atangiaceae, 
Loganiaceae, Icacinaceae, Cornaceae and Rubiaceae families. 

20 

1 6 a mothoH Qf ^nrriin g to claim 15 wherein the cell is derived from a species belonging to 
the Catharanthus genus. 

17. A method according to claim 16 wherein the cell is derived from Catharanthus roseus. 

25 

18. A method according to claim 1 wherein the nucleotide sequence is a sequence coding 
for a transcription factor comprising at least one AP2 domain, including at least two AP2 
domains and at least three AP2 domains. 

30 19. A method according to claim 18 wherein the nucleotide sequence is a sequence 
comprising at least one sequence coding for an AP2 domain selected from the group 
consisting of AP2 domains of SEQ ID NO:5, SEQ ID NO:6 and SEQ ID NO:7. 
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20. A method according to claim 18 wherein the nucleotide sequence is a sequence coding 
for an amino acid sequence selected from the group consisting of SEQ ID NO:5 ( SEQ ID 
NO:6 and SEQ ID NO:7. 

5 21. A method according to claim 18 wherein the nucleotide sequence is a sequence coding 
for an amino acid sequence comprising an at least one AP2 domain having at least 40% 
homology, at least 50%, at least 60%, at least 70% t at least 80% or at least 90% homology 
to any of the at least one AP2 domains of SEQ ID NO:5, SEQ ID NO:6 and SEQ ID NO:7. 

10 22. A method according to claim 18 wherein the nucleotide sequence comprises at least 
one sequence coding for an AP2 domain of a sequence selected from the group consisting 
of SEQ ID NO:1, SEQ ID NO:2, SEQ ID NO:3 and SEQ ID NO:4. 

23. A method according to claim 18 wherein the nucleotide sequence coding for the at 

15 least one AP2 domain is a mutant, allele or variant of a nucleotide sequence coding for an 
AP2 domain of a sequence selected from the group consisting of SEQ ID NO:1 , SEQ ID 
NO:2, SEQ ID NO:3 and SEQ ID NO:4. 

24. A method according to claim 1 wherein the nucleotide sequence coding for a 
20 transcription factor is a homologous nucleotide sequence. 

25. A method according to claim 1 wherein the nucleotide sequence coding for a 
transcription factor is a heterologous nucleotide sequence. 

25 26. A method according to claim 24 or 25 wherein the nucleotide sequence is operably 
linked to a regulating nucleotide sequence with which it is not natively associated. 

27. A method according to claim 1 wherein the metabolite is a plant metabolite. 

30 28. A method according to claim 27 wherein the plant metabolite is a secondary metabolite. 

29. A method according to claim 28 wherein the secondary metabolite is selected from the 
group consisting of an alkaloid compound, a phenolic compound and a terpenoid 
compound. 
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30. A method according to claim 29 wherein the secondary metabolite is derived from a 
compound selected from the group consisting of tryptophan, tyrosine, lysine, ornithine, 
nicotinic acid, anthranilic acid and acetate. 

5 31 . A method according to claim 30 wherein the secondary metabolite is an indole alkaloid 
such as a terpenoid indole alkaloid. 

32. A method according to claim 1 wherein the gene product involved in metabolite 
production is a protein including an enzyme. 



10 



33. A method according to claim 32 wherein the enzyme is an enzyme involved in an 
alkaloid biosynthetic pathway. 

34. A method according to claim 33 wherein the enzyme is selected from the group 
15 consisting of tryptophan decarboxylase (TDC), geraniol-10-hydroxylase (G10H). 

strictosidine synthase (STR) and strictosidine glucosidase (SG). 

35. A method according to claim 1 wherein the expression in the cell of one or more genes 
involved in the biosynthesis of a metabolite or a precursor herefor is enhanced or reduced 

20 relative to a cell into which a nucleotide sequence coding for a transcription factor is not 

in^rtgri or which is without modified expression of the already present nucleotide 

sequence coding for a transcription factor. 

36. A method according to claim 35 wherein the expression in the cell of one or more 

25 genes involved in the biosynthesis of a metabolite or a precursor herefor is enhanced by at 
least 10%, at least 25% or at least 100%. relative to a cell into which a nucleotide 
sequence coding for a transcription factor is not inserted or which is without modified 
expression of the already present nucleotide sequence coding for a transcription factor. 

30 37. A method according to claim 35 wherein the expression in the cell of one or more 
genes involved in the biosynthesis of a metabolite or a precursor herefor is reduced by at 
least 10%, at least 25% or at least 100%, relative to a cell into which a nucleotide 
sequence coding for a transcription factor is not inserted or which is without modified 
expression of the already present nucleotide sequence coding for a transcription factor. 

35 
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38. A method of modulating the stress resistance of a cell, the method comprising 
inserting into the cell a nucleotide sequence coding for a transcription factor, the nucleotide 
s quence is operably linked to at least one expression regulating sequence, and/or 
modifying the expression of a nucleotide sequence coding for such a transcription factor 

5 already present in the cell, the transcription factor is capable of regulating the expression of 
at least one gene coding for a gene product involved in the biosynthesis of said metabolite 
or precursor, and subjecting the cell to conditions where the inserted nucleotide sequence 
coding for a transcription factor is expressed or the expression of the nucleotide sequence 
already present in the cell is modulated, the expression of said nucleotide sequence 
10 resulting in a modified responsiveness of the cell towards exogenous stress conditions. 

39. A method according to claim 38 wherein the ceil is a plant cell 

40. A method according to claim 39 wherein the cell is of a dicotyledonous species. 

15 

41. A method according to claim 40 wherein the cell is of a species selected from the group 
consisting of the Gentianafes order and the Comales order. 

42. A method according to claim 40 wherein the cell is of a species selected from an indole 
20 alkaloid producing species. 

43. A method according to claim 41 or 42 wherein the cell is a species selected from the 
group consisting of species belonging to the Apocynaceae, Alangiaceae, Loganiaceae, 
Icacinaceae, Comaceae and Rubiaceae families. 

25 

44. A method according to claim 43 wherein the cell is of a species belonging to the genus 
Catharanthus. 

45. A method according to claim 44 wherein the cell is a Catharanthus roseus ceil. 

30 

46. A method according to claim 39 wherein the plant cell is in the form of a celt culture. 

47. A method according to claim 39 wherein the plant cell is in a plant tissue. 
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48. A method according to claim 37 wherein the nucleotide sequence coding for a 
transcription factor is derived from a plant cell. 

49. A method according to claim 48 wherein the cell is derived from a dicotyledonous 
5 species. 

50. A method according to claim 49 wherein the cell is derived from a species selected 
from the group consisting of species of the Gentianales order and species of the Comales 
order. 

10 

51. A method according to claim 49 wherein the cell is derived from an indole alkaloid 
producing species. 

52. A method according to claim 50 or 51 wherein the cell is derived from a species 

15 selected from the group consisting of species belonging to the Apocynaceae, Alangiaceae, 
Loganiaceae, Icacinaceae, Cornaceae and Rubiaceae families. 

53. A method according to claim 52 wherein the cell is derived from a species belonging to 
the Catharanthus genus. 

20 

54. A method according to claim 53 wherein the cell is derived from Catharanthus roseus. 

55. A method according to claim 38 wherein the nucleotide sequence is a sequence coding 
for a transcription factor comprising at least one AP2 domain, including at least two AP2 

25 domains and at least three AP2 domains. 

56. A method according to claim 38 wherein the nucleotide sequence is a sequence 
comprising at least one sequence coding for an AP2 domain selected from the group 
consisting of AP2 domains of SEQ ID NO:5 f SEQ ID NO:6 and SEQ ID NO:7. 

30 

57. A method according to claim 38 wherein the nucleotide sequence is a sequence coding 
for an amino acid sequence selected from the group consisting of SEQ ID NO:5, SEQ ID 
NO:6 and SEQ ID NO:7. 
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58. A method according to claim 38 wherein the nucleotide sequence is a sequence coding 
for an amino acid sequence comprising at least one AP2 domain having at least 40% 
homology, including at least 50%, at least 60%, at least 70%, at least 80% or at least 90% 
homology to any of the at least one AP2 domains of SEQ ID NO:5, SEQ ID NO:6 and SEQ 

5 IDNO:7. 

59. A method according to claim 38 wherein the nucleotide sequence comprises at least 
one sequence coding for an AP2 domain of a sequence selected from the group consisting 
of SEQ ID NO:1, SEQ ID NO:2, SEQ ID NO:3 and SEQ ID NO:4. 

10 

60. A method according to claim 38 wherein the nucleotide sequence coding for the at 
least one AP2 domain is a mutant, allele or variant of a nucleotide sequence coding for an 
AP2 domain of a sequence selected from the group consisting of SEQ ID NO:1, SEQ ID 
NO:2, SEQ ID NO:3 and SEQ ID NO:4. 

15 

61. A method according to claim 38 wherein the nucleotide sequence coding for a 
transcription factor is a homologous nucleotide sequence. 

62. A method according to claim 38 wherein the nucleotide sequence coding for a 
20 transcription factor is a heterologous nucleotide sequence. 



63. A method according to claim 61 or 62 wherein the nucleotide sequence is operably 
linked to a regulating nucleotide sequence with which it is not natively associated. 

25 64. A method according to claim 38 wherein the metabolite is a plant metabolite 

65. A method according to claim 64 wherein the plant metabolite is a secondary metabolite. 

66. A method according to claim 65 wherein the secondary metabolite is selected from the 
30 group consisting of an alkaloid compound, a phenolic compound and a terpenoid 

compound. 

67. A method according to claim 66 wherein the secondary metabolite is derived from a 
compound selected from the group consisting of tryptophan, tyrosine, lysine, ornithine, 

35 nicotinic acid, anthranilic acid and acetate. 
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67. A method according to claim 66 wherein the secondary metabolite is an indole alkaloid 
such as a terpenoid indole alkaloid. 

5 69. A method according to claim 38 wherein the gene product involved in metabolite 
production is a protein including an enzyme. 

70. A method according to claim 69 wherein the enzyme is an enzyme involved in an 
alkaloid biosynthetic pathway. 

10 

71 . A method according to claim 70 wherein the enzyme is selected from the group 
consisting of tryptophan decarboxylase (TDC), geraniol-10-hydroxylase (G10H), 
strictosidine synthase (STR) and strictosidine glucosidase (SG). 

15 72. A method according to claim 38 wherein the expression in the cell of one or more 
genes involved in the biosynthesis of a metabolite or a precursor herefor is enhanced or 
reduced relative to a cell into which a nucleotide sequence coding for a transcription factor 
is not inserted or which is without modified expression of the already present nucleotide 
sequence coding for a transcription factor. 

20 

73. A method according to claim 72 wherein the expression in the cell of one or more 
genes involved in the biosynthesis of a metabolite or a precursor herefor is enhanced by at 
least 10%. at least 25% or at least 100%, relative to a cell into which a nucleotide 
sequence coding for 3 transcription factor is not inserted or which is without modified 

25 expression of the already present nucleotide sequence coding for a transcription factor 

74. A method according to claim 72 wherein the expression in the cell of one or more 
genes involved in the biosynthesis of a metabolite or a precursor herefor is reduced by at 
least 10%, at least 25% or at least 100%, relative to a cell into which a nucleotide 

30 sequence coding for a transcription factor is not inserted or which is without modified 
expression of the already present nucleotide sequence coding for a transcription factor. 

75. A recombinant cell having, relative to its parent cell, enhanced or reduced biosynthesis 
of a metabolite or a precursor therefor, and/or enhanced or reduced expression of a gene 

35 product involved in metabolite production, the cell comprising a nucleotide sequence 
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coding for a transcription factor other than a transcription factor having a bHLH-type or a 
MYB-type DNA-binding domain, said transcription factor is capable of regulating the 
expression of at least one gene coding for a gene product involved in the metabolite 
production, said sequence is inserted into the cell and/or its expression is modified by 
operably linking it to a regulating* sequence with which it is not natively associated. 

76. A cell according to claim 75 which is a plant cell 

77. A cell according to claim 76 which is of a dicotyledonous species. 

78. A cell according to claim 77 wherein the cell is a cell of a species selected from the 
group consisting of the Gentianales order and the Cornales order. 

79. A cell according to claim 77 which is a cell of an indole alkaloid producing species. 

80. A cell according to claim 78 or 79 which is a cell of a species selected from the group 
consisting of species-belonging to the Apoaynaceae^AJangiaceae, Loganiaceae, 
Icacinaceae* Comaeeae and Rubiaceae families. 

81 . A cell according to claim 80 which is a cell of a species belonging to the genus 
Catharanthus. 



82. A cell according^© claim 81 which is a Catharanthus roseus cell. 
25 83. A cell according to claim 76 which is in the form of a cell culture. 

84. A cell according to claim 76 wherein the plant cell is in a plant tissue. 

85. A cell according to claim 75 wherein the nucleotide sequence coding for a transcription 
30 factor is derived from a plant cell. 

86. A cell according to claim 85 which is derived from a dicotyledonous species. 

87. A cell according to claim 86 which is derived from a species selected from the group 
35 consisting of species of the Gentianales order and species of the Cornales order. 
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88. A cell according to claim 86 which is derived from an indole alkaloid producing species. 

89. A cell according to claim 87 or 88 which is derived from a species selected from the 
5 group consisting of species belonging to the Apocynaceae, Alangiaceae, Loganiaceae, 

Icacinaceae, Cornaceae and Rubiaceae families. 

90. A cell according to claim 89 which is derived from a species belonging to the 
Catharanthus genus. 

10 

91 . A cell according to claim 90 which is derived from Catharanthus roseus. 

92. A cell according to claim 75 wherein the nucleotide sequence is a sequence coding for 
a transcription factor comprising at least one AP2 domain, including at least two AP2 

15 domains and at least three AP2 domains. 

93. A cell according to claim 85 wherein the nucleotide sequence is a sequence comprising 
at least one sequence coding for an AP2 domain selected from the group consisting of AP2 
domains of SEQ ID NO:5, SEQ ID NO:6 and SEQ ID NO:7. 

20 

94. A cell according to claim 93 wherein the nucleotide sequence is a sequence coding for 
an amino acid sequence selected from the group consisting of SEQ ID NO:5, SEQ ID NO:6 
and SEQ ID NO:7. 

25 95. A cell according to claim 93 wherein the nucleotide sequence is a sequence coding for 
an amino acid sequence comprising at least one AP2 domain having at least 40% 
homology, including at least 50%, at least 60%, at least 70%, at least 80% or at least 90% 
homology to any of the at least one AP2 domains of SEQ ID NO:5, SEQ ID NO:6 and SEQ 
ID NO:7. 

30 

96. A cell according to claim 75 wherein the nucleotide sequence comprises at least one 
sequence coding for an AP2 domain of a sequence selected from the group consisting of 
SEQ ID NO:1, SEQ ID NO:2, SEQ ID NO:3 and SEQ ID NO:4. 
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97. A cell according to claim 75 wherein the nucleotide sequence coding for the at least 
one AP2 domain is a mutant, allele or variant of a nucleotide sequence coding for an AP2 
domain of a sequence selected from the group consisting of SEQ ID NO:1, SEQ ID NO:2, 
SEQ ID NO:3 and SEQ ID NO:4. 

5 

98. A cell according to claim 75 wherein the nucleotide sequence coding for a transcription 
factor is a homologous nucleotide sequence. 

99. A cell according to claim 75 wherein the nucleotide sequence coding for a transcription 
10 factor is a heterologous nucleotide sequence. 

100. A cell according to claim 98 or 99 wherein the nucleotide sequence is operably linked 
to a regulating nucleotide sequence with which it is not natively associated. 

15 101. A cell according to claim 75 wherein the metabolite is a plant metabolite 

102. A cell accoPdiFig^to claiiw 101 wherein the plant metabolite4s a secondary metabolite. 

103. A cell according to claim 102 wherein the secondary metabolite is selected from the 
20 group consisting of an alkaloid compound, a phenolic compound and a terpenoid 

compound. 

104. A method according to claim 103 wherein the secondary metabolite is derived from a 
compound selected from the group consisting of tryptophan, tyrosine, lysine, ornithine, 

25 nicotinic acid, anthranilic acid and acetate. 

105. A cell according to claim 103 wherein the secondary metabolite is an indole alkaloid 
such as a terpenoid indole alkaloid. 

30 106. A cell according to claim 75 wherein the gene product involved in metabolite 
production is a protein including an enzyme. 

107. A cell according to claim 106 wherein the enzyme is an enzyme involved in an 
alkaloid biosynthetic pathway. 

35 
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108. A cell according to claim 107 wherein the enzyme is selected from the group 
consisting of tryptophan decarboxylase (TDC), geraniol-1 0-hydroxylas (G1 OH), 
strictosidine synthase (STR) and strictosidine glucosidas (SG). 

5 109. A cell according to claim 75 where the expression in the cell of one or more genes 
involved in the biosynthesis of a metabolite or a precursor herefor is enhanced or reduced 
relative to a cell into which a nucleotide sequence coding for a transcription factor is not 
inserted or which is without modified expression of the already present nucleotide 
sequence coding for a transcription factor. 

10 

1 10. A cell according to claim 109 where the expression in the cell of one or more genes 
involved in the biosynthesis of a metabolite or a precursor herefor is enhanced by at least 
10%, at least 25% or at least 100%, relative to a cell into which a nucleotide sequence 
coding for a transcription factor is not inserted or which is without modified expression of 

1 5 the already present nucleotide sequence coding for a transcription factor. 

111. A cell according to claim 110 wherein at least one pathway leading to the biosynthesis 
of at least one precursor for the metabolite can be stimulated in said cell in the presence of 
an inducing agent. 

20 

112. A cell according to claim 109 wherein the expression in the cell of one or more genes 
involved in the biosynthesis of a metabolite or a precursor herefor is reduced by at least 
10%, at least 25% or at least 100%, relative to a cell into which a nucleotide sequence 
coding for a transcription factor is not inserted or which is without modified expression of 

25 the already present nucleotide sequence coding for a transcription factor. 

113. A method of producing a metabolite including a plant secondary metabolite, the 
method comprising providing a recombinant cell according to any of claims 75-1 12, 
cultivating said cell under conditions where the nucleotide sequence coding for the 

30 transcription factor regulating the expression of at least one gene coding for a gene product 
involved in the metabolite production is expressed, and recovering the metabolite. 

114. A method according to claim 113 wherein the cell is cultivated under conditions where 
at least one precursor for the metabolite is added in a form which can be assimilated by 

35 said cell. 
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1 15. A method of constructing a recombinant cell according to any of claims 75-1 12, said 
method comprising the steps of (i) identifying in a source cell a nucleotide sequence coding 
for a transcription factor other than a transcription factor having a bHLH-type or a MYB- 

5 type DNA-binding domain, said transcription factor is capable of regulating in the source 
cell the expression of at least one gene coding for a gene product involved in biosynthesis 
of a metabolite or a precursor therefor, (ii) isolating said nucleotide sequence and (iii) 
inserting said isolated nucleotide sequence into a host cell comprising a gene coding for a 
gene product involved in biosynthesis of a metabolite or a precursor therefor, the insertion 
10 of said nucleotide sequence resulting in that the expression of the gene of the host cell is 
modulated relative to the expression level in a host cell not comprising the inserted 
nucleotide sequence. 

1 16. A method according to claim 115 wherein at least two isolated nucleotide sequences 
1 5 are inserted into the host cell. 

1 17. An isolated nucleic acid molecule comprising a nucleotide sequence coding for an 
amino acid sequence selected from the group consisting of SEQ ID NO:5, SEQ ID NO:6 
and SEQ ID NO:7. 

20 

1 18. An isolated nucleic acid according to claim 117 comprising a nucleotide sequence 
which is selected from the group consisting of SEQ ID NO:1, SEQ ID NO:2, SEQ ID NO:3 
and SEQ ID NO:4 and a mutant, allele or variant hereof. 

25 119. A polypeptide which includes an amino acid sequence having at least 40% homology 
including at least 50%, such as at least 60%, at least 70%, at least 80% and at least 90% 
to an amino acid sequence selected from the group consisting of SEQ ID NO:5, SEQ ID 
NO:6 and SEQ ID NO:7. 

30 120. A polypeptide having the amino acid sequence shown in SEQ ID NO:5, SEQ ID NO:6 
or SEQ ID NO:7. 

121. A vector comprising a nucleic acid molecule according to claim 117 or 118. 
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122. A vector according to claim 117 comprising at least one further sequence coding for a 
transcription factor. 

123. A host cell comprising a vector according to claim 1 17 or 1 18. 

5 

124. A cell according to claim 123 which is selected from the group consisting of a 
bacterium, a fungal cell, a yeast cell, an animal cell, a unicellular eukaryotic cell, an algae 
cell and a plant cell. 

10 125. A method of isolating a nucleotide sequence coding for a transcription factor not 
having a bHLH-type or a MYB-type DNA-binding domain, said transcription factor is 
capable of regulating in a cell the expression of at least one gene coding for a gene 
product involved in biosynthesis of a metabolite or a precursor therefor, the method 
comprising the steps of 

15 (i) probing a cDNA or genomic library with a probe comprising at least a fragment of a 
nucleic acid according to claim 117 or 1 18; 

(ii) identifying a DNA clone that hybridises with said at least a fragment of a nucleic acid; 
and 

(iii) isolating the DNA clone identified in step (ii) wherein the nucleic acid sequence is 
20 coding for all or a part of said transcription factor. 



126. A method of isolating a nucleotide sequence coding for a transcription factor not 
having a bHLH-type or a MYB-type DNA-binding domain, said transcription factor is 
capable of regulating in a cell the expression of at least one gene coding for a gene 
25 product involved in biosynthesis of a metabolite or a precursor therefor, the method 
comprising the steps of 

(i) synthesising an oligonucleotide primer set corresponding to at least a fragment of a 
sequence selected from the group consisting of SEQ ID NO:1, 2, 3 and 4; and 

(ii) amplifying cDNA or genomic DNA using said primer set in a polymerase chain reaction 
30 wherein the amplified nucleic acid fragment is coding for all or a part of said transcription 

factor 

to obtain a nucleotide sequence coding for a transcription factor not having a bHLH-type or 
a MYB-type DNA-binding domain. 
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127. An isolated nucleic acid which is a nucleotide sequence as shown in SEQ ID NO: 8 
and a mutant, allele or variant hereof. 

128. Use of an isolated nucleic acid sequence according to claim 127 for isolating a 

5 nucleotide sequence coding for a transcription factor not having a bHLH-type or a MYB- 
type DNA-binding domain, said transcription factor is capable of regulating in a cell the 
expression of at least one gene coding for a gene product involved in biosynthesis of a 
metabolite or a precursor therefor, the method comprising the steps as essentially 
described in Example 2 

10 
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ABSTRACT 

A method for modulating in a cell the expression of one or more genes involved in the 
biosynthesis of a metabolite or a precursor for this metabolite by inserting into a cell a 
5 nucleotide sequence coding for a transcription factor comprising an AP2 DNA-binding 
domain, and/or by modifying the expression of a nucleotide sequence coding for such a 
transcription factor already present in the cell is provided. The method is useful for 
enhancing the biosynthesis of secondary metabolites in plants such as alkaloids including 
terpenoid indole alkaloids. Also provided is a method for enhancing stress tolerance in 
10 plants by the use of such transcription factors. 
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SEQUENCE LISTING 

<110> Johan Memelink 

Leslie van der Fits 
Frank L.H. Menke 
Jan W. Kijne 

<X2 0> METHOD OF MODULATING METABOLITE 
BIOSYNTHESIS IN RECOMBINANT CELLS 



<130> 22209XX1 
<160> 8 

<170> FastSEQ for Windows Version 3.0 

<210> 1 
<211> 2298 
<212> DNA 

<213> Catharanthus roseus 
<400> 1 

atcaaatttc acacagtttt agaactctac gacctatttg ttactgaaaa ttactggaat 60 

tactaaaatc ggaagaagaa atcaacgcga cgaaagagaa aaagaacaaa agggtttcgt 12 0 

ttttgtaaag tttgattctt ggcggagatt ttcgacaaag gagtgggcaa tttgtgcaat 18 0 

acttctgaga aaattgaaag agatacaagg atggctcttc ttgatcaggc atccaatttg 240 

agtcccatgc cttttgattt cactagaaag aggaagtcga ggaggaggga tggtactaag 300 

aacgtagcgg agacacttgc aaagtggaaa gagtataatg agaaacttga tgctttagat 360 

ggagggaaac cagctcggaa ggttcctgcc aaaggatcaa aaaagggatg tatgaaaggt 42 0 

aaaggaggcc ctgagaattc tcactgcaaa tacagaggag ttaggcagag gacatggggt 480 

aaatgggtgg ccgaaattcg ggaaccaaac aggggtagca ggctttggtt gggtacattc 54 0 

agaaacgcga tagaagctgc acttgcttat gatgaagcag cgagggcgat gtatgggcct 600 

tgtgctaggc ttaatcttcc gaactatagg gcttcagaag aatcttcttc cttgccaaca 660 

acatcaggat cagatacgac tactgcttct ggcatctcag aggtctctgt ctatgaagac 720 

aaaaagttca caccagttgt ttccggattg aaacaagatg acaagggtga atcattagag 780 

tcagctgata gtaaacctca actcctggtc gatgctggca ctcccatgag tgcagtgaag 840 

gaagaaccaa aagaatatca ggttatggat tcccagtctg aagggcaatt cggagacgag 900 

gaaccgccta gcaagcttgt ttgtaaagaa gtcgactttg ggcaggatca agctgttgtfc 960 

cctgctgtta aaaatgctga ggagatgggt ggagagatgg gtggagatat actgaaaggc 1020 

tgttctttgt ctgagatgtt tgatgtggac gagttgttga gcgttttaga ttctacaccc 108 0 

ctccatgcct cagattttca gcatggcatg ggaaatggta atgtaaaggc agaggctgct 1140 

tacaactatg ctccttcatg ggactcggcc ttccagttgc agaatcaaga tcctaagcta 1200 

ggaagtcagc agcacatggc gcagacaccc ccagaaatta attccgggct tgattttttg 1260 

cagccaggaa gacaagagga ctcctatttt actttgggtg atctagactt tcttgatttg 132 0 

ggtgctgaat tgggattgta aatccgaagt tgttgaagct aaaagcggcg actatgaaac 1380 

tggaattttg gaacggctta ttgttcctgg tgtttgtctt agttctagtc tgtttatgta 144 0 

ctagaacttg acatatagga ggcttttgaa agctgaacaa acgaagtgtg aatfcattttc 1500 

tttttttgtt tttctgcagc gatgtatact aacatctcta ctactaaaat tacgtctctt 1560 

cgtcttcact aacagtaggg tggagctgat tctcttttaa gtttttcaga aggggaattc 1620 

agctatgagt ttagaggcag ggtagtgtag ttcagtgagc agattctttc tgtagatatc 1680 

tctagtcttt tggtttcttg gaatgttttt tctggtggaa taaaggaagc tgtgttcttg 1740 

tgggatttgg ttcttattag aatcacccct aatttgatca aacagctcca gaacgctcat 18 00 

tctcttccca atcagcctat gcatgaaatt cttctcttct ctctcccctc tcatggccga 18 60 

aaattgcaag agaaaaagag ccatctttag ctctttagtt tcttttcaat ttcttctctt 1920 

cttttctttg tttttcttcg tcggaagtac tcatgcgggt gagagaatac tccagaagat 1980 

catggagttt tgtggaagta ttaagcaatt tcaatggagg tctcaggaac aaatcgcgac 2040 

ttctccggtc tgcgcgtttc ttcctctccg gcaatgtcag gtgactcttg ttgggctcac 2100 

ggtggttagc gtcttatggg gaagctcacg atgaaaatgc atggcgagct gaagattttt 2160 

tttttttttt ttttttttga tgaaaatgct tggccagcaa aattctggcg agccaatttt 2220 

tgcttttctt tctttatatg aaatccttgg accatcaagc tgttcttttt tttaacaaca 2280 

taatgtgtta gccagcct 22 98 
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<210> 2 
<211> 1754 
<212> DNA 

<213> Catharanthus roseus 



<400> 2 

tgtaaatcaa atttcacaca gttttagaac tctacgacct atttgttact gaaaattact 60 

ggaattacta aaatcggaag aagaaatcaa cgcgacgaaa gagaaaaaga acaaaagggt 120 

ttcgtttttg taaagtttga ttcttggcgg agattttcga caaaggagtg ggcaatttgt 180 

gcaatacttc tgagaaaatt gaaagagata caaggatggc tcttcttgat caggcatcca 240 

atttgagtcc catgcctttt gatttcacta gaaagaggaa gtcgaggagg agggatggta 300 

ctaagaacgt agcggagaca cttgcaaagt ggaaagagta taatgagaaa cttgatgctt 3 60 

tagatggagg gaaaccagct cggaaggttc ctgccaaagg atcaaaaaag ggatgtatga 420 

aaggtaaagg aggccctgag aattctcact gcaaatacag aggagttagg cagaggacat 480 

ggggtaaatg ggtggccgaa attcgggaac caaacagggg tagcaggctt tggttgggta 540 

cattcagaaa cgcgatagaa gctgcacttg cttatgatga agcagcgagg gcgatgtatg 600 

ggccttgtgc taggcttaat cttccgaact atagggcttc agaagaatct tcttccttgc 660 

caacaacatc aggatcagat acgactactg cttctggcat ctcagaggtc tctgtctatg 720 

aagacaaaaa gttcacacca gttgtttccg gattgaaaca agatgacaag ggtgaatcat 780 

tagagtcagc tgatagtaaa cctcaactcc tggtcgatgc tggcactccc atgagtgcag 84 0 

tgaaggaaga accaaaagaa tatcaggtta tggattccca gtctgaaggg caattcggag 900 

acgaggaacc gcctagcaag cttgtttgta aagaagtcga ctttgggcag gatcaagctg 960 

ttgttcctgc tgttaaaaat gctgaggaga tgggtggaga gatgggtgga gatatactga 1020 

aaggctgttc tttgtctgag atgtttgatg tggacgagtt gttgagcgtt ttagattcta 10 80 

cacccctcca tgcctcagat tttcagcatg gcatgggaaa tggtaatgta aaggcagagg 114 0 

ctgcttacaa ctatgctcct tcatgggact cggccttcca gttgcagaat caagatccta 1200 

agctaggaag tcagcagcac atggcgcaga cacccccaga aattaattcc gggcttgatt 12 60 

ttttgcagcc aggaagacaa gaggactcct attttacttt gggtgatcta gactttcttg 1320 

atttgggtgc tgaattggga ttgtaaatcc gaagttgttg aagctaaaag cggcgactat 13 80 

gaaactggaa ttttggaacg gcttattgtt cctggtgttt gtcttagttc tagtctgttt 1440 

atgtactaga acttgacata taggaggctt ttgaaagctg aacaaacgaa gtgtgaatta 1500 

ttttcttttt ttgtttttct gcagcgatgt atactaacat ctctactact aaaattacgt 1560 

ctcttcgtct tcactaacag tagggtggag ctgattctct tttaagtttt tcagaagggg 1620 

aattcagcta tgagtttaga ggcagggtag tgtagttcag tgagcagatt ctttctgtag 1680 

atatctctag tcttttggtt tcttggaatg ttttttctgg tggaataaag atggcatagg 1740 

tggaggttgt atct 1754 

<210> 3 _____ 

<211> 885 ~~~ 

<212> DNA 

<213> Catharanthus roseus 



<400> 3 

caacaataat gtatcaatca aatgcccata 
tagtagatta tcaattcctc aacaacgatt 
attacgctaa ttataattat aatacttcta 
atgaaaattg cgaagaaatt atttcaccaa 
ttttaacaga tattttcaag gatcaggata 
aagaagaaga attaattacg acacctacct 
agagatcgaa tgaggaatgg attaggtacc 
tcgctgcgga aatcagggat cccaagagaa 
agacggcgga agatgcggca ttagctttcg 
gagctaggtt aaattttccc aatcttattg 
ctagacgccg atcttcatcg tgtcatcttc 
gggatagtaa attttttctt tgagtttttt 
aaacattgca aatatttttt tagtacgtct 
tgagcatggg gaaggtgata ttatccattg 
taatatgtac gaattacaag taaaatatag 

<210> 4 
<211> 792 
<212> DNA 

<213> Catharanthus roseus 



attccgatca tctaaccttc ttaccacctt 60 

ttgatttttc agaaatattt acagatttca 120 

cctcagataa tttctctggt tttcaattca 180 

attatgcttc ggaagattta tcggatatta 24 0 

attacgaaga cgaagtcgtt gcgggagaac 3 00 

ctcgcggcgg cggcggcggc ggatgtgagc 3 60 

gtggcgttag acggcggcca tgggggaaat 420 

aaggatcgag gatatggttg ggaacttacg 480 

atcaagcggc gtttcaactc cgtggttcta 540 

gttctgctaa tgctccggtt agagtaagtc 600 

gtcctcaata atcctatcca cagttccatg 660 

agaagttata ttatctattg aaaaaataca 720 

ctatacttct ttttagtaat attcggatca 780 

tcataaatta atagatacag tatcataaat 840 

taagtgttaa tattg 88 5 
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<400> 4 

ttctaaaaaa gaagaaaaat gtccgaagaa atcatttccg tctcagatcg atttcttctt 60 

tccttaatcg aagaacatct tctcagcgat aattctgatg attccagctc ggaattgact 12 0 

tctacagagg aaaattggga agaaattttt gcagattttc taaattggtc gggatccgaa 18 0 

atacagaaac gcggtagccc gagttccgaa agctgtcaat cgaattcaat ggcggaaagc 240 

tgtcaggagg attctgttgt gggaaccccg ccagaagcgg cggccggagg aggttgttcg 300 

aaggattgga accggtataa gggcgttaga cggcggccgt gggggaagtt cgcggcggag 3 60 

ataagggatc cgaaaaagaa aggatccagg atttggttgg gtacatacga gacacctgag 420 

gatgcagcat tggcttatga tgcagccgcg tttaatatgc gtggagctaa agctaggctt 480 

aattttcctc atttgattgg ttcgaatatt tccggacccg ttagagtaaa cccgagaaaa 540 

cgtttccctg cggagccttc tacgacgtcg tcgtcttctt cttcttcttc gtctgaaaafc 600 

agtggaggaa ggaagaagag acgatattaa ttaattatta aaagtggagg attaaaaaaa 660 

ttctgtgaaa tgagagatta ttacgtggtt tttgttaagc ccgataatcc ctcattgtaa 720 

aattattaac ttcatcgatg ttctttttta aatctttgga atgtacaaaa ttttatatcc 78 0 

aaaaaagttc ac 792 

<210> 5 
<211> 376 
<212> PRT 

<213> Catharanthus roseus 



<400> 5 
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